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TRANSCYTOSIS OF ADENO- ASSOCIATED VIRUSES 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This claims the benefit of U.S. Provisional Application No. 60/607,854, entitled 
"Transcytosis of Adeno-Associated Viruses", filed September 8, 2004, by Chiorini et al, 
1 0 which is herein incorporated by reference in its entirety. 

BACKGROUND 

The adeno-associated viruses (AAV) were originally classified according to size, 
structure, and dependence upon a helper virus for replication. AAV is a member of the 
Parvoviridae, a virus family characterized by a single stranded linear DNA genome and a 

15 small icosahedral shaped capsid measuring about 20nm in diameter. AAV was first 

described as a contaminant of tissue culture grown simian virus 15, a simian adeno virus 
and was found dependent on adenovirus for measurable replication. This led to its name, 
adeno-associated virus, and its classification in the genus Dependovirus. Because the 
majority of AAV isolates were first identified as contaminants of laboratory stocks of 

20 adenovirus, little is known about their natural tissue tropism. However in vivo experiments 
suggest they are effective vectors for gene transfer applications. Currently eleven full-length 
isolates have been cloned and their initial characterization indicates that each serotype has 
unique binding/cell tropism characteristics. 

Transcytosis is the transport of macromolecular cargo from one side of a cell to the 

25 other within membrane-bounded carrier(s). It is a strategy used by multicellular organisms 
to selectively move material between two different environments while maintaining the 
distinct compositions of those environments. The ability of a pathogen to spread through a 
tissue is a critical determinate of its virulence. The process of transcytosis has been reported 
for a number of viruses. For example, HTV and poliovirus cross simple epithelial cells 

30 without infection and are still infectious when they cross into the submucosa. Likewise, the 
Epstein-Barr virus (EBV) forms a complex with mucosal immimoglobulins (IgA) that are 
specific for gp350, a viral surface protein that is present in latently infected people. This 
complex binds to the poly-immunoglobulin receptor at the basal surface of epithelial cells, 
and is endocytosed and delivered apically without infection. To date, there is no report of 

35 transcytosis by any AAV. 
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5 

Provided herein are methods for transcytosis across barrier epithelial cells using 
AAV vectors. The ability of a non-pathogenic vector to transcytose barrier epithelial cells 
can be used to deliver genes to sub-epithelial targets. One important example includes the 
delivery of genes across the blood-brain-barrier without the need for direct injection into the 
10 brain. Furthermore, herein is described a method for re-directing virus that enters a cell by 
transcytosis to result in transduction of the cell by blocking exocytosis. 

SUMMARY 

In accordance with the purpose(s) of this invention, as embodied and broadly 
described herein, this invention, in one aspect, relates to a method of delivering a 
15 heterologous nucleic acid across an epithelial barrier comprising delivering to the epithelial 
barrier an AAV vector comprising the heterologous nucleic acid. The epithelial cells can be 
in the gut, lung, genitourinary tract, kidney, blood vessels or brain. 

In another aspect, the invention relates to a method of transcytosing epithelial cells 
of a human subject comprising administering to the subject a viral vector comprising a 
20 heterologous nucleic acid, wherein the viral vector is selected from a group consisting of 
BAAV, AAV4, AAV5, or AAV7. 

Additional advantages of the invention will be set forth in part in the description 
which follows, and in part will be obvious from the description, or may be learned by 
practice of the invention. The advantages of the invention will be realized and attained by 
25 means of the elements and combinations particularly pointed out in the appended claims. It 
is to be understood that both the foregoing general description and the following detailed 
description are exemplary and explanatory only and are not restrictive of the invention, as 
claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 The accompanying drawings, which are incorporated in and constitute a part of this 

specification, illustrate (one) several embodiment(s) of the invention and together with the 
description, serve to explain the principles of the invention. 

Figure 1 shows that AAV4 transcytosed in CaCo-2, MDCKI, MDCKH, Human 
primary immortalized epithelial endometrial, Bovine brain primary endothelia cells (BBB). 

35 AAV5 transcytosed CaCo-2 cells, whereas BAAV transcytosed in MDCKs, Endometrial, 
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5 airways epithelia, and BBB. AAV6 did not transcytose in any of cell types tested. Hela cells 
do not form barrier epithelia and were used as a control. 

Figure 2 shows that the treatment of the basal lateral surface of Human primary 
airways epithelial cell (HAE) with tannic acid blocked the transcytosis of BAAV vector 
containing a GFP expression cassette from the apical surface to the basal lateral. 
10 Furthermore transduction dramatically increased when assayed at 24 hrs post inoculation. 
In contrast no change was observed in AAV2 transduction, which did not demonstrate any 
transcytosis activity and has limited binding activity on HAE. 

Figure 3 shows AAV7 transcytosis assay on bovine brain endothelial cells. Virus 
DNA extracted from basal lateral medium after 3H incubation 2xl0 9 DRP of AAV were 
15 loaded on the apical side of the cell layer. AAV5 is used as a control. 

DETAILED DESCRIPTION 

The present invention may be understood more readily by reference to the following 
detailed description of the invention and the Examples included therein and to the Figures 
and their previous and following description. 

20 Before the present compounds, compositions, articles, devices, and/or methods are 

disclosed and described, it is to be understood that this invention is not limited to specific 
synthetic methods, specific cell types, or to particular tissues, as such may, of course, vary. 
It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only and is not intended to be limiting. 

25 As used in the specification and the appended claims, the singular forms "a," "an" 

and "the" include plural referents unless the context clearly dictates otherwise. Thus, for 
example, reference to "a pharmaceutical carrier" includes mixtures of two or more such 
carriers, and the like. 

Ranges may be expressed herein as from "about" one particular value, and/or to 

30 "about" another particular value. When such a range is expressed, another embodiment 
includes from the one particular value and/or to the other particular value. Similarly, when 
values are expressed as approximations, by use of the antecedent "about," it will be 
understood that the particular value forms another embodiment. It will be further understood 
that the endpoints of each of the ranges are significant both in relation to the other endpoint, 

35 and independently of the other endpoint. 
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5 "Optional" or "optionally" as used herein means that the subsequently described 

event or circumstance may or may not occur, and that the description includes instances 
where said event or circumstance occurs and instances where it does not. 

AAV Transcytosis 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

1 0 barrier comprising delivering to the epithelial barrier an AAV vector comprising the 

heterologous nucleic acid. In one aspect of the method, the AAV is AAV4, AAV5, AAV7, 
or BAAV. The AAV capsid protein forming the viral particle is understood herein to confer 
upon the AAV particle the desired transcytosing ability. Thus, "AAV vector", as used 
herein, refers to any virion, vector, or viral particle comprising or encoding at least one 

1 5 AAV capsid protein. As an example, an AAV4 vector can encode an AAV4 capsid protein 
and thus be encapsidated in said protein forming an AAV4 particle. Alternatively the AAV 
vector can comprise a nucleic acid encoding a modified AAV or a portion of an AAV capsid 
protein (a capsid protein fragment) that confers serotype-specific trancytotic activity. AAV 
capsids, capsid protein fragments and capsid modifications are disclosed, for example, in 

20 U.S. Patent Application No. 60/526786 (BAAV), U.S. Patent No. 6,468,524 (AAV4), U.S. 
Patent Application No. 09/717,789 (AAV5), U.S. Patent Application 2003/0228282 
(AAV7), International Application No. PCT/US04/15534, filed May 19, 2004 (AAAV), and 
U.S. Patent Application No. 60/676604, filed April 29, 2005 (AAV-X1, AAV-Xlb, AAV- 
X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, AAV-X26). 

25 In another aspect of the method, the epithelial cells are in the gut, lung, genitourinary 

tract, kidney, blood vessels or brain. In another aspect of the method, the epithelial cells can 
be selected from a group consisting of bronchial, alveolar, tracheal or upper airway 
epithelial cells; absorptive enterocytes or M cells; endometrial or urinary epithelial cells; 
renal collecting duct or proximal tubule epithelial cells; cerebral microvascular endothelial 

30 cells or Choroidal Plexus epithelial cells. 

Further disclosed is a method of transcytosing epithelial cells of a human subject 
comprising administering to the subject an AAV vector comprising a heterologous nucleic 
acid. In one aspect of the method, the vector is AAV4, AAV5, AAV7, or BAAV. In another 
aspect of the method, the epithelial cells are selected from a group consisting of bronchial, 

35 alveolar, tracheal or upper airway epithelial cells; absorptive enterocytes or M cells; 
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5 endometrial or urinary epithelial cells; renal collecting duct or proximal tubule epithelial 
cells; cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells. 

Further contemplated are methods for the delivery of molecules across epithelial cell 
barriers comprising coupling the molecules to non-recombinant (wild-type) AAV capsids or 
particles. In one aspect, the molecules are radioligands or enzymes. 

10 The term "adeno-associated virus (AAV)" is used herein to refer to a genus of 

viruses in the family Parvoviridae which are all defective viruses (unable to replicate by 
themselves) and depend on the co-infection of their host cell by other, nondefective viruses 
to help them replicate. 

Transcytosis refers to the transport of macromolecular cargo from one side of a cell 

15 to the other, generally within a membrane-bounded carriers). Tuma and Hubbard provided 
a review of transcytosis (Tuma PL and Hubbard AL. 2003. Physiol Rev. 83:871-932), herein 
incorporated by reference for its teaching regarding the nature and uses for trancytosis. 
Transcytosis is a strategy used by multicellular organisms to selectively move material 
between two different environments while maintaining the distinct compositions of those 

20 environments. N. Simionescu was the first to coin the term transcytosis to describe the 
vectorial transfer of macromolecular cargo within the plasmalemmal vesicles from the 
circulation across capillary endothelial cells to the interstitium of tissues. During this same 
period, another type of transcytosis was being discovered. Immunologists comparing the 
different types of immunoglobulins found in various secretions (e.g., serum, milk, saliva, 

25 and the intestinal lumen) speculated that the form of IgA found in external secretions (called 
secretory IgA, due to the presence of an additional protein component) was selectively 
transported across the epithelial cell barrier. More is known about transcytosis as it is 
expressed in epithelial tissues, which form cellular barriers between two environments. In 
this polarized cell type, net movement of material can be in either direction, apical to 

30 basolateral or the reverse, depending on the cargo and particular cellular context of the 
process. However, transcytosis is not restricted to only epithelial cells. 

Since the 19th century dye experiments of Ehrlich, the brain has been known as a 
privileged" organ where access is tightly regulated so that the environment remains 
chemically stable. The two principal gatekeepers of the brain are the cerebral capillary 

35 endothelium and the cuboidal epithelial cells of the choroid plexus. These cellular barriers 
are specialized for the passage of different nutrients from the blood. The capillaries move 
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5 nutrients that are required rapidly and in large quantities, such as glucose and amino acids. 
These small molecules are transported by membrane carriers using facilitated diffusion. The 
choroid plexus supplies nutrients that are required less acutely and in lower quantities. 
These are folate and other vitamins, ascorbate, and deoxyribonucleotides. 

There are two epithelial cells that participate in transcytosis in the intestine, M cells 

10 and enterocytes (adsorptive columnar cells). These cells are very different from one another 
and the capillary endothelial cell. Depending on the species, M cells comprise a variable but 
small percentage of the epithelia overlying organized mucosal-associated lymphoid tissue, 
making them a very minor cell population in the gastrointestinal tract. The transcytotic 
route across M cells is thought to be part of the mechanism by which antigens are routinely 

15 sampled along the entire mucosal surface. Not surprisingly, numerous pathogens have 
evolved mechanisms to exploit the transcytotic process as a means to invade and 
disseminate before a strong enough immune response can be mounted. 

Absorptive enterocytes are simple columnar cells with several apical features in 
addition to their brush borders. Clathrin-coated pits are present at the base of microvilli, and 

20 a thick glycocalyx composed of integral membrane proteins with glycosaminoglycan side 
chains emanates from the microvillar membrane. This latter structural feature as well as the 
rigidity of the microvilli are thought to prohibit microorganisms from attaching and 
invading enterocytes. The intracellular organization of these columnar epithelial cells is also 
polarized, with basally located nuclei, supranuclear Golgi, and an abundance of 

25 pleomorphic membrane compartments underlying the terminal web of the brush border. 
The basolateral-to-apical length of this cell is -20 versus 0.2 fim for a capillary endothelial 
cell, making the transcytotic route across enterocytes potentially much longer. Furthermore, 
microtubules are an important structural element of the transcytotic pathway in enterocytes, 
but not in M or endothelial cells. 

30 Transcytosis also occurs in the upper regions of the respiratory tract and has been 

demonstrated with two vector systems, plgA-R and FcRn, but others could exist. Secretory 
IgA is a known constituent of the lung's immune defense system, with bronchial epithelial 
cells carrying out basolateral-to-apical transport of dlgA, which is secreted by local plasma 
cells in underlying lymphoid tissue. Albumin, which is found in lung fluid, is endocytosed 

35 specifically at the apical surface of airway epithelia but is then subsequently degraded. At 
the alveolar level, the question of whether albumin is transcytosed intact is uncertain. 
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5 The methods and compositions described herein can be used to deliver heterologous 

nucleic acids to certain tissues. As used herein, the term "nucleic acid" refers to single-or 
multiple stranded molecules which may be DNA or RNA, or any combination thereof, 
including modifications to those nucleic acids. The nucleic acid may represent a coding 
strand or its complement, or any combination thereof. Nucleic acids may be identical in 

10 sequence to the sequences which are naturally occurring for any of the novel genes 

discussed herein or may include alternative codons which encode the same amino acid as 
those provided herein, including that which is found in the naturally occurring sequence. 
These nucleic acids can also be modified from their typical structure. Such modifications 
include, but are not limited to, methylated nucleic acids, the substitution of a non-bridging 

15 oxygen on the phosphate residue with either a sulfur (yielding phosphorothioate 

deoxynucleotides), selenium (yielding phosphorselenoate deoxynucleotides), or methyl 
groups (yielding methylphosphonate deoxynucleotides). 

As used herein, the term "isolated" refers to a nucleic acid separated or significantly 
free from at least some of the other components of the naturally occurring organism, for 

20 example, the cell structural components or viral components commonly found associated 
with nucleic acids in the environment of the virus and/or other nucleic acids. The isolation 
of the native nucleic acids can be accomplished, for example, by techniques such as cell 
lysis followed by phenol plus chloroform extraction, followed by ethanol precipitation of the 
nucleic acids. The nucleic acids of this invention can be isolated from cells according to any 

25 of many methods well known in the art. 

The AAV vectors disclose herein can comprise a heterologous nucleic acid 
functionally linked to the promoter. The term "heterologous" is used herein to refer to a 
nucleic acid which is derived from a different cell, tissue or organism. The nucleic acid can 
encode a polypeptide or protein or an antisense RNA, for example. By "functionally linked" 

30 is meant such that the promoter can promote expression of the heterologous nucleic acid, as 
is known in the art, such as appropriate orientation of the promoter relative to the 
heterologous nucleic acid. Furthermore, the heterologous nucleic acid preferably has all 
appropriate sequences for expression of the nucleic acid, as known in the art, to functionally 
encode, i.e., allow the nucleic acid to be expressed. The nucleic acid can include, for 

35 example, expression control sequences, such as an enhancer, and necessary information 
processing sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, 
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5 and transcriptional terminator sequences. 

The heterologous nucleic acid can encode beneficial proteins that replace missing or 
defective proteins required by the subject into which the vector in transferred or can encode 
a cytotoxic polypeptide that can be directed, e.g., to cancer cells or other cells whose death 
would be beneficial to the subject. The heterologous nucleic acid can also encode antisense 

10 RNAs that can bind to, and thereby inactivate, mRNAs made by the subject that encode 
harmful proteins. In one embodiment, antisense polynucleotides can be produced from a 
heterologous expression cassette in an AAV4 viral construct where the expression cassette 
contains a sequence that promotes cell-type specific expression (Wirak et al., 1991. EMBO 
10:289). For general methods relating to antisense polynucleotides, see Antisense RNA and 

15 DNA, D. A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988). 
Examples of heterologous nucleic acids which can be administered to a cell or 
subject as part of the present AAV4 vector can include, but are not limited to the following: 
nucleic acids encoding therapeutic agents, such as tumor necrosis factors (TNF), such as 
TNF-a; interferons, such as interferon-o; interferon-^ and interferon-y, interleukins, such as 

20 IL-1, IL-lft and ILs -2 through -14; GM-CSF; adenosine deaminase; cellular growth factors, 
such as lymphokines; soluble CD4; Factor VHI; Factor DC; T-cell receptors; LDL receptor; 
ApoE; ApoC; alpha- 1 antitrypsin; ornithine transcarbamylase (OTC); cystic fibrosis 
transmembrane receptor (CFTR); insulin; Fc receptors for antigen binding domains of 
antibodies, such as immunoglobulins; and antisense sequences which inhibit viral 

25 replication, such as antisense sequences which inhibit replication of hepatitis B or hepatitis 
non-A, non-B virus. The nucleic acid is chosen considering several factors, including the 
cell to be transfected. Where the target cell is a blood cell, for example, particularly useful 
nucleic acids to use are those which allow the blood cells to exert a therapeutic effect, such 
as a gene encoding a clotting factor for use in treatment of hemophilia. Furthermore, the 

30 nucleic acid can encode more than one gene product, limited only, if the nucleic acid is to be 
packaged in a capsid, by the size of nucleic acid that can be packaged. 

The term "polypeptide" as used herein refers to a polymer of amino acids and 
includes full-length proteins and fragments thereof. Thus, "protein," polypeptide," and 
"peptide" are often used interchangeably herein. Substitutions can be selected by known 

35 parameters to be neutral {see, e.g., Robinson WE Jr, and Mitchell WM., 1990. AIDS 

4:S151-S162). As will be appreciated by those skilled in the art, the invention also includes 
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5 those polypeptides having slight variations in amino acid sequences or other properties. 
Such variations may arise naturally as allelic variations (e.g., due to genetic polymorphism) 
or may be produced by human intervention (e.g„ by mutagenesis of cloned DNA 
sequences), such as induced point, deletion, insertion and substitution mutants. Minor 
changes in amino acid sequence are generally preferred, such as conservative amino acid 

10 replacements, small internal deletions or insertions, and additions or deletions at the ends of 
the molecules. Substitutions maybe designed based on, for example, the model of Dayhoff, 
et al (in Atlas of Protein Sequence and Structure 1978, Nat'l Biomed. Res. Found., 
Washington, D.C.). These modifications can result in changes in the amino acid sequence, 
provide silent mutations, modify a restriction site, or provide other specific mutations. 

15 The term "epithelia" is used herein to refer to cells which are linked tightly together 

by intercellular junctions to form a planar sheet. These sheets of cells form a barrier between 
two compartments. Epithelia therefore line all surfaces and cavities (including skin, 
peritoneum, linings of the intestine, airways, genitourinary tracts, glands, and blood vessels. 
An epithelium has a free or apical surface facing the environment, or lumen of a 

20 cavity, and a basal surface facing the underlying connective tissue. The boundary between 
the basal surface of an epithelium and the underlying connective tissue is usually very sharp, 
and is the site where the basal lamina (BL) is present. Most BL are too thin to be seen with 
the light microscope. However, the BL, together with a thin layer of connective tissue, is 
often times seen at the epitheUal/connective tissue interface. This composite layer, visible 

25 with the light microscope, was initially called the Basement Membrane. Application of the 
electron microscope revealed that, in most cases, this Basement Membrane actually 
consisted of the true basal lamina (lamina lucida plus lamina densa), along with a layer of 
adherent connective tissue. 

For convenience of description, epithelia are classified into different types based on 

30 the number of cell layers and the cell shape. 

Epithelia which are 1 cell layer thick are called "simple" epithelia. Thus, each cell 
rests on the basal lamina, but also has a surface facing the lumen/outside world. Epithelia 
which are 2 or more cell layers thick are called "stratified" epithelia. In stratified epithelia, 
the basal layer of cells rests on the basal lamina, but subsequent layers do not, and are 

35 simply stacked on top of the basal layer. The cells of the most superficial layer have a free 
surface, "squamous" cells are very flat, like a fried egg, where the yolk is the nucleus. The 
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5 nucleus is distinctly flattened, the cell is often so thin that this flattened nucleus bulges the 
cell surface outward, "cuboidal" cells range from true cuboidal where the cell is about as 
high as it is wide, to a flattened cuboidal where the cell is wider than high. In cuboidal cells 
the nucleus is usually round, and not flattened as in squamous, "columnar" cells are 2 or 
more times as high as wide. Nucleus is usually elongated in the long axis of the cell. 

1 0 Squamous cells form the lining of cavities such as the mouth, blood vessels, heart 

and lungs and make up the outer layers of the skin. Cuboidal epithelium is found in glands 
and in the lining of the kidney tubules as well as in the ducts of the glands. They also 
constitute the germinal epithelium which produces the egg cells in the female ovary and the 
sperm cells in the male testes. Columnar epithelium forms the lining of the stomach and 

1 5 intestines. Some columnar cells are specialized for sensory reception such as in the nose, 
ears and the taste buds of the tongue. 

Ciliated columnar epithelial cells posses fine hair-like outgrowths, cilia on their free 
surfaces. These cilia are capable of rapid, rhythmic, wavelike beatings in a certain direction. 
Ciliated epithelium is usually found in the air passages like the nose. It is also found in the 

20 uterus and Fallopian tubes of females. 

Columnar epithelium with goblet cells is called glandular epithelium. Some parts of 
the glandular epithelium consist of such a large number of goblet cells that there are only a 
few normal epithelial cells left. Columnar and cuboidal epithelial cells often become 
specialized as gland cells which are capable of synthesizing and secreting certain substances 

25 such as enzymes, hormones, milk, mucus, sweat, wax and saliva. Unicellular glands consist 
of single, isolated glandular cells such as the goblet cells. Sometimes a portion of the 
epithelial tissue becomes invaginated and a multicellular gland is formed. Multicellular 
glands are composed of clusters of cells. Most glands are multicellular including the salivary 
glands. 

30 Where body linings have to withstand wear and tear, the epithelia are composed of 

several layers of cells and are then called compound or stratified epithelium. The top cells 
are flat and scaly and it may or may not be keratinized (i.e. containing a tough, resistant 
protein called keratin). The mammalian skin is an example of dry, keratinized, stratified 
epithelium. The lining of the mouth cavity is an example of an unkeratinized, stratified 

35 epithelium. 
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5 In vitro Cell Models of Transcytosis 

The use of in vitro cell models to study transcytosis has many advantages over in 
vivo systems. First, variation among animals is eliminated, as is the confounding issue of 
cargo possibly being modified or endocytosed by cell types other than the one under study. 
Moreover, in vitro systems can be manipulated in ways not possible in vivo, allowing 

10 investigators to measure the effects of different variables (e.g., temperatures, 
pharmacological agents, etc.) with greater precision and to explore the molecular 
mechanisms of transcytosis. 

The integrity of the monolayer is obviously vital to every study of transcytosis, and 
there are different methods for assessing it. Transepithelial electrical resistance (TER) 

1 5 measurements are commonly used as an indication of tight junction integrity in a monolayer, 
and commercial instruments are available for these measurements. 

Caco-2 cells, human primary colon carcinoma cells, are a well studied model of 
intestinal absorptive enterocytes. They are the most commonly used intestinal cell line 
because they differentiate furthest along the cryptto-villus axis and are the easiest to 

20 transfect. Caco-2 cells have been especially used to model transcytosis of bacteria, which 
can cross barrier epithelia in (he gut and brain (Zhang JR, et al., 2000. Cell 102(6):827-37), 
incorporated herein by reference. 

There is little evidence for in vivo transcytosis of macromolecular cargo in kidney. 
Nonetheless, MDCK cells, which are derived from dog kidney, are the most-studied 

25 epithelial cell model and have been used extensively to study transcytosis. These cells were 
originally developed by nephrologists for permeability and electrical studies. Their 
subsequent use by cell biologists for studies of the formation of tight junctions, 
establishment of polarity, and vesicle traffic have popularized MDCK cells. An advantage is 
that MDCK cells are easily cultured, easily transfected, and become polarized 3-5 days after 

30 seeding. They were used in the now classical studies showing that enveloped viruses bud in 
a polarized fashion and that the newly synthesized viral membrane glycoproteins are 
targeted directly from the TGN to the appropriate PM domain. Furthermore, much of the 
current understanding of the IgA transcytotic pathway and the sorting signals in the plgA-R 
comes from the elegant studies performed in MDCK cells. Two MDCK strains with very 

35 different features were identified some time ago. The MDCK I cell has a high TER and 
characteristics reminiscent of the renal collecting duct, whereas the more commonly used 
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5 MDCK n strain, whose TER is one order of magnitude lower than that of MDCK I cells, 

has phenotypic features closer to those of the renal proximal tubule. 

Both primary cells and cell lines, alone and in coculture with endothelial cells, are 

being used to study transcytosis in the lung. Clonetics bronchial/tracheal epithelial cell 

systems contain normal human bronchial/treacheal epithelial cells. This cell system has been 
10 used for experimental applications in cancer research, respiratory disease, cellular function 

and differentiation. 

The Clonetics® bovine Brain Microvascular Endothelial Cell System (bMVEC-B) is 
a model of the "Blood Brain Barrier". The system is designed to significantly improve a 
researcher's ability to study active and passive transport of drugs across the blood brain 

15 barrier, to study brain endothelial cell tight junctions, and to study the basic biology of brain 
microvascular endothelial cells (Schinket AH ,1999. Advanced Drug Delivery Reviews 
36:179-194; Tsukita S. et al., 1998. Molecular dissection of tight junctions:occluding and 
ZO-1 in Introduction to the Blood -Brain Barrier. Edited by William M Partridge; Inglis et 
al., 2004. Brain Research 998: 21 8-229), each of which is incorporated by reference for its 

20 teaching of in vitro endothelial cell modeling of the blood-brain barrier. 

Endometrial cells form an important barrier layer in the genitourinary tract. The 
cells used to model this system were developed by Kyo et al. and are derived from primary 
cells immortalized by the addition of the papillomiavirus E6/E7 genes and human 
telomerase reverse transcriptase. The isolated cells have a normal chromosomes and retain 

25 their responsiveness to sex-steriod hormones, exhibit glandular structure on three 

dimensional culture , and lack a transformed phenotype (Kyo S, et al. Am J Pathol., 2003. 
163(6):2259-69), incorporated herein by reference for its teaching of this endometrial 
model. 

Methods of Use 

30 The use of AAVs to deliver genes to the lung would be of benefit in genetic diseases 

like cystic fibrosis, pseudohypoaldosteronism, and immotile cilia syndrome. Furthermore, 
delivering genes to the lung would be of impact in several non-genetic diseases. For 
example, delivering genes that make antibiotic like peptides to the cells underlying the 
epithelia would be useful to prevent or treat bronchitis; delivering genes that make growth 

35 factors would be of value in common diseases like chronic bronchitis. Also, AAVs could be 
used to deliver genes that may play a role in asthma, like B>10, or antibodies to IgE and 
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5 interleukins. The use of an AAV vector to deliver genes through the alveolar epithelia 
would be of benefit in genetic diseases like alpha- 1 -antitrypsin deficiency. Furthermore, 
delivering genes through the alveolar epithelia would be of significance in several 
pulmonary non-genetic diseases. For example, delivering genes that make antibiotic like 
peptides would be useful to prevent or treat pneumonia (perhaps of antibiotic-resistant 

10 organisms); dehvering genes that make growth factors would be of value in emphysema; 
delivering genes thatover-express the epithelial sodium channel or the Na-K ATPase could 
be used to treat cardiogenic and non-cardiogenic pulmonary edema; delivering genes that 
have an anti-fibrosis effect like interferon for pulmonary fibrosis would also be useful. Also, 
AAVs could be used to deliver genes that may have a systemic effect like anti-hypertension 

1 5 drugs, insulin, coagulation factors, antibiotics, growth factors, hormones and others. 

The use of AAVs to deliver genes to the central nervous system (CNS)/ brain would 
be of benefit in neurological diseases, including Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder, panic disorder, learning disabilities, ALS, triplet expansions 

20 diseases, psychoses, autism, lysosomal storage diseases, Gaucher's disease, Hurler's 

disease, Krabbe's disease, battens disease, and altered behaviors (e.g., disorders in feeding, 
sleep patterns, balance, and perception). 

The use of AAVs to deliver genes to the gastrointestinal system/ gut would be of 
benefit in treatment of diseases and/or Gastrointestinal Disorders such as colon cancers, 

25 inflammatory bowel disease, diabetes, or Crohn's disease. 

The use of AAVs to deliver genes to the genitourinary system would be of benefit in 
treatment of diseases of the female reproductive tract, molecular defects in implantation 
disorders, and gynecological cancers. These methods would also have contraceptive 
applications. 

30 The use of AAVs to deliver genes to the kidney would be of benefit in treatment of 

inherited renal disorders such as polycystic kidney disease, Alport's syndrome, hereditary 

nephritis, primary hyperoxaluria, and cystinuria. 

The use of AAVs for wide-spread delivery of genes across blood vessels into the 

muscle would be of benefit in neuromuscular diseases like muscular dystrophy and 
35 Cardiovascular Disorders such as heart disease, restenosis, atherosclerosis, myocarditis, 

stoke, angina, or thrombosis. 
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5 The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 

tissues of a subject would be of benefit in the treatment of certain cancers (e.g., gastric, 
ovarian, lung, bladder, liver, and breast). 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 
tissues of a subject would be of benefit in the treatment of certain inflammatory disorders, 

10 including, but not limited to, adrenalitis, alveolitis, angiocholecystitis, appendicitis, 
balanitis, blepharitis, bronchitis, bursitis, carditis, cellulitis, cervicitis, cholecystitis, 
chorditis, cochlitis, colitis, conjunctivitis, cystitis, dermatitis, diverticulitis, encephalitis, 
endocarditis, esophagjtis, eustachitis, fibrositis, follicuhtis, gastritis, gastroenteritis, 
gingivitis, glossitis, hepatosplenitis, keratitis, labyrinthitis, laryngitis, lymphangitis, mastitis, 

1 5 media otitis, meningitis, metritis, mucitis, myocarditis, myosititis, myringitis, nephritis, 
neuritis, orchitis, osteochondritis, otitis, pericarditis, peritendonitis, peritonitis, pharyngitis, 
phlebitis, pohomyelitis, prostatitis, pulpitis, retinitis, rhinitis, salpingitis, scleritis, 
sclerochoroiditis, scrotitis, sinusitis, spondylitis, steatitis, stomatitis, synovitis, syringitis, 
tendonitis, tonsillitis, urethritis, and vaginitis; and disorders that are characterized by 

20 inflammation such as hepatitis, rheumatoid arthritis, gout, trauma, pancreatitis, sarcoidosis, 
dermatitis, renal ischemia-reperfusion injury, Grave's disease, systemic lupus erythematosus, 
diabetes mellitus, and allogenic transplant rejection. 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 
tissues of a subject would be of benefit in the treatment of other diseases, syndromes and 

25 conditions, such as adenosine deaminase deficiency, sickle cell deficiency, thalassemia, 

hemophilia, diabetes, phenylketonuria, growth disorders, and defects of the immune system. 

BAAV 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier of the lung, comprising delivering to the lung a BAAV vector comprising the nucleic 
30 acid. In one aspect of the method, the epithelial barrier comprises human bronchial, 

alveolar, tracheal or upper airway epithelial cells. Thus, disclosed is a method of delivering 
a heterologous nucleic acid across human airway epithelial cells, comprising delivering to 
the cells a BAAV vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
35 barrier in the brain, comprising delivering to the brain a BAAV vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human cerebral 
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5 microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 

barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells a BAAV vector 
comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across the epithelial 

1 0 barrier of blood vessels into the muscle, comprising delivering to the blood stream a BAAV 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human vascular endothelial cells. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the genitourinary tract, comprising delivering to the genitourinary tract a BAAV 

1 5 vector comprising the nucleic acid genitourinary tract. In one aspect of the method, the 

epithelial barrier comprises human endometrial or urinary epithelial cells. Thus, disclosed is 
a method of delivering a heterologous nucleic acid across human endometrial epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an -epithelial 

20 barrier in the kidney, comprising delivering to the genitourinary tract a BAAV vector 

comprising the nucleic acid genitourinary tract. In one aspect of the method, the epithelial 
barrier comprises human renal collecting ducts or proximal tubules. Thus, disclosed is a 
method of delivering a heterologous nucleic acid across human kidney epithelial cells, 
comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

25 Disclosed is a method of transcytosing lung epithelial cells of a subject comprising 

contacting the lung epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
bronchial, tracheal, or upper airway epithelial cells. 

Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 

30 contacting the CNS epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

Disclosed is a method of transcytosing vascular epithelial cells of a subject 
35 comprising contacting the vascular epithelial cells of the subject with a BAAV vector 
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comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
human vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the genitourinary tract epithelial cells of the subject with a BAAV 
vector comprising a heterologous nucleic acid. In one aspect of the method, the epithelial 
cells are human endometrial or urinary tract epithelial cells. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the kidney epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
human renal collecting ducts or proximal tubules 

AAV5 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the gut, comprising delivering to the gut an AAV5 vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human absorptive 
enterocytes or M cells. Thus, disclosed is a method of delivering a heterologous nucleic acid 
across human gut epithelial cells enterocytes, comprising delivering to the cells an AAV5 
vector comprising the nucleic acid. 

Disclosed is a method of transcytosing gut epithelial cells of a subject comprising 
contacting the gut epithelial cells of the subject with an AAV5 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
absorptive enterocytes. 
AAV4 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the gut, comprising delivering to the gut an AAV4 vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human absorptive 
enterocytes or M cells. Thus, disclosed is a method of delivering a heterologous nucleic acid 
across human gut epithelial ceUs enterocytes, comprising delivering to the cells an AAV4 
vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the lung, comprising delivering to the lung an AAV4 vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human bronchial, 
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5 tracheal, or upper airway epithelial cells. Thus, disclosed is a method of delivering a 

heterologous nucleic acid across human airway epithelial cells, comprising delivering to the 
cells an AAV4 vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the CNS, comprising delivering to the CNS an AAV4 vector comprising the 

10 nucleic acid. In one aspect of the method, the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 
barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells a AAV4 vector 
comprising the nucleic acid. 

1 5 Disclosed is a method of delivering a heterologous nucleic acid across the epithelial 

barrier of blood vessels into the muscle, comprising delivering to the blood stream an AAV4 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

20 barrier in the genitourinary tract, comprising delivering to the genitourinary tract an AAV4 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human endometrial or urinary epithelial cells. Thus, disclosed is a method of 
delivering a heterologous nucleic acid across human endometrial epithelial cells, comprising 
delivering to the cells an AAV4 vector comprising the nucleic acid. 

25 Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

barrier in the kidneys, comprising delivering to the kidneys an AAV4 vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. Thus, disclosed is a method of delivering a 
heterologous nucleic acid across human kidney epithelial cells, comprising delivering to the 

30 cells an AAV4 vector comprising the nucleic acid. 

Disclosed is a method of transcytosing lung epithelial cells of a subject comprising 
contacting the lung epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
bronchial, tracheal, or upper airway epithelial cells. 

35 Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 

contacting the CNS epithelial cells of the subject with an AAV4 vector comprising a 
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heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

Disclosed is a method of transcytosing vascular epithelial cells of a subject 
comprising contacting the vascular epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the genitourinary epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
human endometrial or urinary epithelial cells. 

Disclosed is a method of Iranscytosing kidney epithelial cells of a subject comprising 
contacting the kidney epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human renal 
collecting ducts or proximal tubules 

Disclosed is a method of transcytosing gut epithelial cells of a subject comprising 
contacting the gut epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
absorptive enterocytes. 
AAV7 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the CNS, comprising delivering to the CNS an AAV7 vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 
barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells an AAV7 vector 
comprising the nucleic acid. 

Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 
contacting the CNS epithelial cells of the subject with an AAV7 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 
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5 Inhibition of Transcytosis to Increase Transduction 

Described herein is a method for re-directing virus that enters a cell by transcytosis 
to result in transduction of the cell by blocking exocytosis. Thus, provided is a method of 
improving the efficiency of nucleic acid delivery to epithelial cells, comprising delivering to 
the cells an inhibitor of exocytosis and an AAV vector.containing the nucleic acid. Also 
10 provided is a method for transducing cells that have transcytosis activity but are normally 
resistant to transduction comprising administering to the cells inhibitors of exocytosis. 

In one aspect of the methods, the AAV vector is derived from AAV4, AAV5, or . 
BAAV. In a further aspect of the methods, the epithelial cell barriers are located in the 
kidney, gut, lung or vascular endothelium 
1 5 Thus, disclosed is a method of delivering a heterologous nucleic acid to human 

airway epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
kidney epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
20 an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
vascular endothelial cells, comprising delivering to the cells and an inhibitor of exocytosis 
and an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
25 airway epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
a BAAV vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
kidney epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
a BAAV vector comprising the nucleic acid. 
30 Further disclosed is a method of delivering a heterologous nucleic acid to human 

vascular endothelial cells, comprising delivering to the cells and an inhibitor of exocytosis 
and a BAAV vector comprising the nucleic acid 

Further disclosed is a method of delivering a heterologous nucleic acid to human gut 
epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and an 
35 AAV5 vector comprising the nucleic acid. 
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5 In one aspect of the disclosed methods, the inhibitors of exocytosis are chemical 

modifiers. In a further aspect of the methods, the chemical modifier is tannic acid, wherein 
the tannic acid is delivered to the basal lateral surface of the epithelial cells. 

Compositions and methods for making AAV4 vectors 

Compositions and methods for making and using AAV4 vectors have been 

10 previously described in U.S. Patent No. 6,468,524, which is hereby incorporated herein by 
reference for this teaching. 

Provided is the nucleotide sequence of the adeno-associated virus 4 (AAV4) genome 
and vectors and particles derived therefrom. Specifically, provided is a nucleic acid vector 
comprising a pair of AAV4 inverted terminal repeats (ITRs) and a promoter between the 

1 5 inverted terminal repeats. The AAV4 ITRs are exemplified by the nucleotide sequence set 
forth in SEQ ID NO:6 and SEQ ID NO:20; however, these sequences can have minor 
modifications and still be contemplated to constitute AAV4 ITRs. The nucleic acid listed in 
SEQ ID NO:6 depicts the ITR in the "flip" orientation of the ITR. The nucleic acid listed in 
SEQ ID NO:20 depicts the ITR in the "flop" orientation of the ITR. Minor modifications in 

20 an ITR of either orientation are those that will not interfere with the hairpin structure formed 
by the AAV4 ITR as described herein and known in the art. Furthermore, to be considered 
within the term "AAV4 ITRs" the nucleotide sequence must retain the Rep binding site 
described herein and exemplified in SEQ ID NO:6 and SEQ ID NO:20, i.e., it must retain 
one or both features described herein that distinguish the AAV4 ITR from the AAV2 ITR: 

25 (1) four (rather than three as in AAV2) "GAGC" repeats and (2) in the AAV4 ITR Rep 
binding site the fourth nucleotide in the first two "GAGC" repeats is a T rather than a C. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. Promoters can be an exogenous or an endogenous 

30 promoter. Promoters can include, for example, known strong promoters such as SV40 or the 
inducible metallothionein promoter, or an AAV promoter, such as an AAV p5 promoter. 
Additional examples of promoters include promoters derived from actin genes, 
immunoglobulin genes, cytomegalovirus (CMV), adenovirus, bovine papilloma virus, 
adenoviral promoters, such as the adenoviral major late promoter, an inducible heat shock 

35 promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), etc. Specifically, the 
promoter can be AAV2 p5 promoter or AAV4 p5 promoter. More specifically, the AAV4 
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5 p5 promoter can be about nucleotides 1 30 to 291 of SEQ ID NO: L Additionally, the p5 
promoter may be enhanced by nucleotides 1-130. Furthermore, smaller fragments of p5 
promoter that retain promoter activity can readily be determined by standard procedures 
including, for example, constructing a series of deletions in the p5 promoter, linking the 
deletion to a reporter gene, and determining whether the reporter gene is expressed, i.e., 

1 0 transcribed and/or translated. 

The present invention also contemplates any unique fragment of these AAV4 nucleic 
acids, including the AAV4 nucleic acids set forth in SEQ ID NOs: 1, 3, 5, 6, 7, 12-15, 17 
and 19. Fragments can be, for example, at least about 30, 40, 50, 75, 100, 200 or 500 
nucleotides in length. The nucleic acid can be single or double stranded, depending upon the 

15 purpose for which it is intended. 

The present invention further provides an AAV4 Capsid polypeptide or a unique 
fragment thereof. AAV4 capsid polypeptide is encoded by ORF 2 of AAV4. Specifically, 
provided is an AAV4 Capsid protein comprising the amino acid sequence encoded by 
nucleotides 2260-4464 of the nucleotide sequence set forth in SEQ ID NO:l, or a unique 

20 fragment of such protein. The present invention also provides an AAV4 Capsid protein 

consisting essentially of the amino acid sequence encoded by nucleotides 2260-4464 of the 
nucleotide sequence set forth in SEQ ID NO: 1, or a unique fragment of such protein. The 
present invention further provides the individual AAV4 coat proteins, VP1, VP2 and VP3. 
Thus, provided is an isolated polypeptide having the amino acid sequence set forth in SEQ 

25 ID NO:4 (VP1). The present invention additionally provides an isolated polypeptide having 
the amino acid sequence set forth in SEQ ID NO:16 (VP2). The present invention also 
provides an isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 18 
(VP3) By "unique fragment thereof is meant any smaller polypeptide fragment encoded by 
any AAV4 capsid gene that is of sufficient length to be unique to the AAV4 Capsid protein. 

30 Substitutions and modifications of the amino acid sequence can be made as described above 
and, further, can include protein processing modifications, such as glycosylation, to the 
polypeptide. However, an AAV4 Capsid polypeptide including all three coat proteins will 
have at least about 63% overall homology to the polypeptide encoded by nucleotides 2260- 
4464 of the sequence set forth in SEQ ID NO: 1. The protein can have about 65%, about 

35 70%, about 75%, about 80%, about 85%, about 90%, about 95% or even 100% homology to 
the amino acid sequence encoded by the nucleotides 4467of the sequence set forth in SEQ 
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5 ID NO: 1 . An AAV4 VP2 polypeptide can have at least about 58%, about 60%, about 70%, 
about 80%, about 90% about 95% or about 100% homology to the amino acid sequence set 
forth in SEQ ID NO: 16. An AAV4 VP3 polypeptide can have at least about 60%, about 
70%, about 80%, about 90% about 95% or about 100% homology to the amino acid 
sequence set forth in SEQ ID NO: 18. 

10 The herein described AAV4 nucleic acid vector can be encapsidated in an AAV 

particle. In particular, it can be encapsidated in an AAV1 particle, an AAV2 particle, an 
AAV3 particle, an AAV4 particle, or an AAV5 particle by standard methods using the 
appropriate capsid proteins in the encapsidation process, as long as the nucleic acid vector 
fits within the size limitation of the particle utilized. The encapsidation process itself is 

15 standard in the art. 

An AAV4 particle is a viral particle comprising an AAV4 capsid protein. An AAV4 
capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide can overall have at 
least about 63% homology to the polypeptide having the amino acid sequence encoded by 
nucleotides 2260-4464 set forth in SEQ ID NO:l (AAV4 capsid protein). The capsid protein 

20 can have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by nucleotides 2260-4464 set forth in SEQ 
ID NO:l. The particle can be a particle comprising both AAV4 and AAV2 capsid protein, 
i.e., a chimeric protein. Variations in the amino acid sequence of the AAV4 capsid protein 

25 are contemplated herein, as long as the resulting viral particle comprising the AAV4 capsid 
remains antigenically or immunologically distinct from AAV2, as can be routinely 
determined by standard methods. Specifically, for example, ELISA and Western blots can 
be used to determine whether a viral particle is antigenically or immunologically distinct 
from AAV2. Furthermore, the AAV4 viral particle preferably retains tissue tropism 

30 distinction from AAV2, such as that exemplified in the examples herein, though an AAV4 
chimeric particle comprising at least one AAV4 coat protein may have a different tissue 
tropism from that of an AAV4 particle consisting only of AAV4 coat proteins. 

An AAV4 particle is a viral particle comprising an AAV4 capsid protein. An AAV4 
capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide can overall have at 

35 least about 63% homology to the polypeptide having the amino acid sequence encoded by 
nucleotides 2260-4467 set forth in SEQ ID NO:l (AAV4 capsid protein). The capsid protein 
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5 can have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by nucleotides 2260-4467 set forth in SEQ 
ID NO:l. The particle can comprise only VP1 and VP3 and still stably transduce cells. 
The particle can be a particle comprising both AAV4 and AAV2 capsid protein, i.e., a 

1 0 chimeric protein. Variations in the amino acid sequence of the AAV4 capsid protein are 
contemplated herein, as long as the resulting viral particle comprising the AAV4 capsid 
remains antigenically or immunologically distinct from AAV2, as can be routinely 
determined by standard methods. Specifically, for example, ELISA and Western blots can 
be used to determine whether a viral particle is antigenically or immunologically distinct 

1 5 from AAV2. Furthermore, the AAV4 viral particle preferably retains tissue tropism 

distinction from AAV2, such as that exemplified in the examples herein, though an AAV4 
chimeric particle comprising at least one AAV4 coat protein may have a different tissue 
tropism from that of an AAV4 particle consisting only of AAV4 coat proteins. 

The invention further provides an AAV4 particle containing, i.e. , encapsidating, a 

20 vector comprising a pair of AAV2 inverted terminal repeats. The nucleotide sequence of 
AAV2 ITRs is known in the art. Furthermore, the particle can be a particle comprising both 
AAV4 and AAV2 capsid protein, i.e., a chimeric protein. The vector encapsidated in the 
particle can further comprise an exogenous nucleic acid inserted between the inverted 
terminal repeats. 

25 The present invention further provides an isolated nucleic acid comprising the 

nucleotide sequence set forth in SEQ ID NO:l (AAV4 genome). This nucleic acid, or 
portions thereof, can be inserted into other vectors, such as plasmids, yeast artificial 
chromosomes, or other viral vectors, if desired, by standard cloning methods. The present 
invention also provides an isolated nucleic acid consisting essentially of the nucleotide 

30 sequence set forth in SEQ ID NO:l. The nucleotides of SEQ ID NO:l can have minor 
modifications and still be contemplated by the present invention. For example, 
modifications that do not alter the amino acid encoded by any given codon (such as by 
modification of the third, "wobble," position in a codon) can readily be made, and such 
alterations are known in the art. Furthermore, modifications that cause a resulting neutral 

35 amino acid substitution of a similar amino acid can be made in a coding region of the 

genome. Additionally, modifications as described herein for the AAV4 components, such as 
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5 the HRs, the p5 promoter, etc. are contemplated in this invention. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with an isolated nucleic acid consisting essentially of the nucleotide sequence set 
forth in SEQ ID NO: 1 (AAV4 genome). The present invention further provides an isolated 
nucleic, acid that selectively hybridizes with an isolated nucleic acid comprising the 

10 nucleotide sequence set forth in SEQ ID NO: 1 (AAV4 genome). By "selectively hybridizes" 
as used in the claims is meant a nucleic acid that specifically hybridizes to the particular 
target nucleic acid under sufficient stringency conditions to selectively hybridize to the 
target nucleic acid without significant background hybridization to a nucleic acid encoding 
an unrelated protein, and particularly, without detectably hybridizing to AAV2. Thus, a 

1 5 nucleic acid that selectively hybridizes with a nucleic acid of the present invention will not 
selectively hybridize under stringent conditions with a nucleic acid encoding a different 
protein, and vice versa. Therefore, nucleic acids for use, for example, as primers and probes 
to detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 
that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 

20 probes for further hybridization or for amplification methods (e.g. , polymerase chain 

reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both AAV4 and a gene of interest carried 
within the AAV4 vector (i. e. , a chimeric nucleic acid). 

The present invention further provides an isolated nucleic acid encoding an adeno- 

25 associated virus 4 Rep protein. The AAV4 Rep proteins are encoded by open reading frame 
(ORF) 1 of the AAV4 genome. The AAV4 Rep genes are exemplified by the nucleic acid 
set forth in SEQ ID NO:3 (AAV4 ORF1), and include a nucleic acid consisting essentially 
of the nucleotide sequence set forth in SEQ ED NO:3 and a nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO:3. The present invention also includes a nucleic 

30 acid encoding the amino acid sequence set forth in SEQ ID NO: 2 (polypeptide encoded by 
AAV4 ORF1). However, the present invention includes that the Rep genes nucleic acid can 
include any one, two, three, or four of the four Rep proteins, in any order, in such a nucleic 
acid. Furthermore, minor modifications are contemplated in the nucleic acid, such as silent 
mutations in the coding sequences, mutations that make neutral or conservative changes in 

35 the encoded amino acid sequence, and mutations in regulatory regions that do not disrupt 
the expression of the gene. Examples of other minor modifications are known in the art. 
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5 Further modifications can be made in the nucleic acid, such as to disrupt or alter expression 
of one or more of the Rep proteins in order to, for example, determine the effect of such a 
disruption; such as to mutate one or more of the Rep proteins to determine the resulting 
effect, etc. However, in general, a modified nucleic acid encoding all four Rep proteins will 
have at least about 90%, about 93%, about 95%, about 98% or 100% homology to the 

10 sequence set forth in SEQ ID NO:3, and the Rep polypeptide encoded therein will have 
overall about 93%, about 95%, about 98%, about 99% or 100% homology with the amino 
acid sequence set forth in SEQ ID NO:2. 

The present invention also provides an isolated nucleic acid that selectively 
hybridizes with a nucleic acid consisting essentially of the nucleotide sequence set forth in 

15 SEQ ID NO:3 and an isolated nucleic acid that selectively hybridizes with a nucleic acid 
comprising the nucleotide sequence set forth in SEQ ID NO:3. "Selectively hybridizing" is 
defined elsewhere herein. 

The present invention also provides each individual AAV4 Rep protein and the 
nucleic acid encoding each. Thus provided is the nucleic acid encoding a Rep 40 protein, 

20 and in particular an isolated nucleic acid comprising the nucleotide sequence set forth in 

SEQ ID NO: 12, an isolated nucleic acid consisting essentially of the nucleotide sequence set 
forth in SEQ ID NO: 12, and a nucleic acid encoding the adeno-associated virus 4 Rep 
protein having the amino acid sequence set forth in SEQ ID NO:8. The present invention- 
also provides the nucleic acid encoding a Rep 52 protein, and in particular an isolated 

25 nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 13, an isolated 
nucleic acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 13, 
and a nucleic acid encoding the adeno-associated virus 4 Rep protein having the amino acid 
sequence set forth in SEQ ID NO:9. The present invention further provides the nucleic acid 
encoding a Rep 68 protein, and in particular an isolated nucleic acid comprising the 

30 nucleotide sequence set forth in SEQ ID NO:14, an isolated nucleic acid consisting 
essentially of the nucleotide sequence set forth in SEQ ID NO:14, and a nucleic acid 
encoding the adeno-associated virus 4 Rep protein having the amino acid sequence set forth 
in SEQ ID NO: 10. And, further, provided is the nucleic acid encoding a Rep 78 protein, and 
in particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 

35 NO:15, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 15, and a nucleic acid encoding the adeno-associated virus 4 Rep protein 
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5 having the amino acid sequence set forth in SEQ ID NO: 1 1 . As described elsewhere herein, 
these nucleic acids can have minor modifications, including silent nucleotide substitutions, 
mutations causing neutral amino acid substitutions in the encoded proteins, and mutations in 
control regions that do not or minimally affect the encoded amino acid sequence. 

The present invention further provides a nucleic acid encoding the entire AAV4 

1 0 Capsid polypeptide. Specifically, provided is a nucleic acid having the nucleotide sequence 
set for the nucleotides 2260-4467 of SEQ ID NO:l. Furthermore, provided is a nucleic acid 
encoding each of the three AAV4 coat proteins, VP1, VP2, and VP3. Thus, provided is a 
nucleic acid encoding AAV4 VP1 , a nucleic acid encoding AAV4 VP2, and a nucleic acid 
encoding AAV4 VP3. Thus, provided is a nucleic acid encoding the amino acid sequence 

15 set forth in SEQ ID NO:4 (VP 1); a nucleic acid encoding the amino acid sequence set forth 
in SEQ ID NO: 16 (VP2), and a nucleic acid encoding the amino acid sequence set forth in 
SEQ ID NO:18 (VP3). The present invention also specifically provides a nucleic acid 
comprising SEQ ID NO:5 (VP1 gene); a nucleic acid comprising SEQ ED NO: 17 (VP2 
gene); and a nucleic acid comprising SEQ ID NO:19 (VP3 gene). The present invention also 

20 specifically provides a nucleic acid consisting essentially of SEQ ID NO:5 (VP 1 gene), a 
nucleic acid consisting essentially of SEQ ID NO: 17 (VP2 gene), and a nucleic acid 
consisting essentially of SEQ ID NO: 19 (VP3 gene). Furthermore, a nucleic acid encoding 
an AAV4 capsid protein VP 1 is set forth as nucleotides 2260-4467 of SEQ ID NO:l; a 
nucleic acid encoding an AAV4 capsid protein VP2 is set forth as nucleotides 2668-4467 of 

25 SEQ ID NO: 1 ; and a nucleic acid encoding an AAV4 capsid protein VP3 is set forth as 
nucleotides 2848-4467 of SEQ ID NO: 1 . Minor modifications in the nucleotide sequences 
encoding the capsid, or coat, proteins are contemplated, as described above for other AAV4 
nucleic acids. 

Provided is an isolated AAV4 Rep protein. AAV4 Rep polypeptide is encoded by 
30 ORF1 of AAV4. Specifically, provided is an AAV4 Rep polypeptide comprising the amino 
acid sequence set forth in SEQ ID NO:2, or a unique fragment thereof. The present 
invention also provides an AAV4 Rep polypeptide consisting essentially of the amino acid 
sequence set forth in SEQ ID NO:2, or a unique fragment thereof. Additionally, nucleotides 
291-2306 of the AAV4 genome, which genome is set forth in SEQ ID NO:l, encode the 
35 AAV4 Rep polypeptide. The present invention also provides each AAV4 Rep protein. Thus 
provided is AAV4 Rep 40, or a unique fragment thereof. The present invention particularly 
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5 provides Rep 40 having the amino acid sequence set forth in SEQ ID NO:8. Provided is 
AAV4 Rep 52, or a unique fragment thereof. The present invention particularly provides 
Rep 52 having the amino acid sequence set forth in SEQ ID NO:9. Provided is AAV4 Rep 
68, or a unique fragment thereof. The present invention particularly provides Rep 68 having, 
the amino acid sequence set forth in SEQ ID NO: 1 0. Provided is AAV4 Rep 78, or a unique 

10 fragment thereof. The present invention particularly provides Rep 78 having the amino acid 
sequence set forth in SEQ ID NO: 11. By "unique fragment thereof is meant any smaller 
polypeptide fragment encoded by AAV rep gene that is of sufficient length to be unique to 
the Rep polypeptide. Substitutions and modifications of the amino acid sequence can be 
made as described above and, further, can include protein processing modifications, such as 

1 5 glycosylation, to the polypeptide. However, a polypeptide including all four Rep proteins 
will encode a polypeptide having at least about 91% overall homology to the sequence set 
forth in SEQ ID NO:2, and it can have about 93%, about 95%, about 98%, about 99% or 
100% homology with the amino acid sequence set forth in SEQ ID NO:2. 

The present invention further provides an AAV4 Capsid polypeptide or a unique 

20 fragment thereof. AAV4 capsid polypeptide is encoded by ORF 2 of AAV4. Specifically, 
provided is an AAV4 Capsid protein comprising the amino acid sequence encoded by 
nucleotides 2260-4467.of the nucleotide sequence set forth in SEQ ID NO:l, or a unique 
fragment of such protein. The present invention also provides an AAV4 Capsid protein 
consisting essentially of the amino acid sequence encoded by nucleotides 2260-4467 of the 

25 nucleotide sequence set forth in SEQ ID NO: 1 , or a unique fragment of such protein. The 
present invention further provides the individual AAV4 coat proteins, VP1, VP2 and VP3. 
Thus, provided is an isolated polypeptide having the amino acid sequence set forth in SEQ 
ID NO:4 (VP1). The present invention additionally provides an isolated polypeptide having 
the amino acid sequence set forth in SEQ ID NO: 1 6 (VP2). The present invention also 

30 provides an isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 1 8 
(VP3). By "unique fragment thereof is meant any smaller polypeptide fragment encoded by 
any AAV4 capsid gene that is of sufficient length to be unique to the AAV4 Capsid protein. 
Substitutions and modifications of the amino acid sequence can be made as described above 
and, further, can include protein processing modifications, such as glycosylation, to the 

35 polypeptide. However, an AAV4 Capsid polypeptide including all three coat proteins will 
have at least about 63% overall homology to the polypeptide encoded by nucleotides 2260- 
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5 4467 of the sequence set forth in SEQ ID NO: 1. The protein can have about 65%, about 
70%, about 75%, about 80%, about 85%, about 90%, about 95% or even 100% homology to 
the amino acid sequence encoded by the nucleotides 2260-4467 of the sequence set forth in 
SEQ ED NO:4. An AAV4 VP2 polypeptide can have at least about 58%, about 60%, about 
70%, about 80%, about 90% about 95% or about 100% homology to the amino acid 

10 sequence set forth in SEQ ID NO:16. An AAV4 VP3 polypeptide can have at least about 
60%, about 70%, about 80%, about 90% about 95% or about 100% homology to the amino 
acid sequence set forth in SEQ ID NO: 18. 

The AAV inverted terminal repeats in the vector for the herein described delivery 
methods can be AAV4 inverted terminal repeats. Specifically, they can comprise the nucleic 

15 acid whose nucleotide sequence is set forth in SEQ ID NO:6 or the nucleic acid whose 

nucleotide sequence is set forth in SEQ ID NO:20, or any fragment thereof demonstrated to 
have ITR functioning. The ITRs can also consist essentially of the nucleic acid whose 
nucleotide sequence is set forth in SEQ ID NO:6 or the nucleic acid whose nucleotide 
sequence is set forth in SEQ ID NO:20. Furthermore, the AAV inverted terminal repeats in 

20 the vector for the herein described nucleic acid delivery methods can also comprise AAV2 
inverted terminal repeats. Additionally, the AAV inverted terminal repeats in the vector for 
this delivery method can also consist essentially of AAV2 inverted terminal repeats. 

Compositions and methods for making AAV5 vectors 

Compositions and methods for making and using AAV5 vectors have been 
25 previously described in U.S. Patent Application No. 09/717,789, filed November 21, 2000, 
and in U.S. Patent No. 6,855,314, which are hereby incorporated herein by reference for this 
teaching. 

The present application provides a recombinant adeno-associated virus 5 (AAV5). 
This virus has one or more of the characteristics described below. The compositions of the 
30 present invention do not include wild-type AAV5. The methods of the present invention can 
use either wild-type AAV5 or recombinant AAV5-based delivery. 

Provided are novel AAV5 particles, recombinant AAV5 vectors, recombinant AAV5 
virions and novel AAV5 nucleic acids and polypeptides. An AAV5 particle is a viral 
particle comprising an AAV5 capsid protein. A recombinant AAV5 vector is a nucleic acid 
35 construct that comprises at least one unique nucleic acid of AAV5. A recombinant AAV5 
virion is a particle containing a recombinant AAV5 vector, wherin the particle can be either 
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5 an AAV5 particle as described herein or a non-AAV5 particle. Alternatively, the 

recombinant AAV5 virion is an AAV5 particle containing a recombinant vector, wherein 
the vector can be either an AAV5 vector as described herein or a non-AAV5 vector. These 
vectors, particles, virions, nucleic acids and polypeptides are described below. 

Provided is the nucleotide sequence of the AAV5 genome and vectors and particles 

1 0 derived therefrom. Specifically, provided is a nucleic acid vector comprising a pair of 
AAV5 inverted terminal repeats (ITRs) and a promoter between the inverted terminal 
repeats. While the rep proteins of AAV2 and AAV5 will bind to either a type 2 ITR or a 
type 5 ITR, efficient genome replication only occurs when type 2 Rep replicates a type 2 
ITR and a type 5 Rep replicates a type 5 ITR. This specificity is the result of a difference in 

15 DNA cleavage specificity of the two Reps which is necessary for replication. AAV5 Rep 
cleaves at CGGT A GTGA (SEQ ID NO: 43) and AAV2 Rep cleaves at CGGT A TGAG (SEQ 
ID NO: 44) (Chiorini et al., 1999. J. Virol. 73 (5) 4293-4298). Mapping of the AAV5 ITR 
terminal resolution site (TRS) identified this distinct cleavage site, CGGT A GTGA, which is 
absent from the ITRs of other AAV serotypes. Therefore, the minimum sequence necessary 

20 to distinguish AAV5 from AAV2 is the TRS site where Rep cleaves in order to replicate the 
virus. Examples of the type 5 ITRs are shown in SEQ ID NO: 41 and SEQ ID NO: 42, 
AAV5 ITR "flip" and AAV5 "flop", respectively. Minor modifications in an ITR of either 
orientation are contemplated and are those that will not interfere with the hairpin structure 
formed by the AAV5 ITR as described herein. Furthermore, to be considered within the 

25 term "AAV5 ITR" the nucleotide sequence must retain one or more features described 
herein that distinguish the AAV5 ITR from the rTRs of other serotypes, e.g. it must retain 
the Rep binding site described herein. 

The D- region of the AAV5 ITR (SEQ ID NO: 45), a single stranded region of the 
ITR, inboard of the TRS site, has been shown to bind a factor which depending on its 

30 phosphorylation state correlates with the conversion of the AAV from a single stranded 
genome to a transcriptionally active form that allows for expression of the viral DNA. This 
region is conserved between AAV2, 3, 4,and 6 but is divergent in AAV5. The D+ region is 
the reverse complement of the D- region. 

The promoter can be any desired promoter, selected by known considerations, such 

35 as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. That is, the promoter can be tissue/cell-specific. 
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5 Promoters can be prokaryotic, eukaryotic, fungal, nuclear, mitochondrial, viral or plant 
promoters. Promoters can be exogenous or endogenous to the cell type being transduced by 
the vector. Promoters can include, for example, bacterial promoters, known strong 
promoters such as SV40 or the inducible metallotbionem promoter, or an AAV promoter, 
such as an AAV p5 promoter. Additionally, chimeric regulatory promoters for targeted gene 

10 expression can be utilized. Examples of these regulatory systems, which are known in the 
art, include the tetracycline based regulatory system which utilizes the tet transactivator 
protein (tTA), a chimeric protein containing the VP16 activation domain fused to the tet 
repressor of Escherichia coli, the IPTG based regulatory system, the CED based regulatory 
system, and the Ecdysone based regulatory system. Other promoters include promoters 

1 5 derived from actin genes, immunoglobulin genes, cytomegalovirus (CMV), adenovirus, 
bovine papilloma virus, adenoviral promoters, such as the adenoviral major late promoter, 
an inducible heat shock promoter, respiratory syncytial virus, Rous sarcomas virus (RS V), 
etc., specifically, the promoter can be AAV2 p5 promoter or AAV5 p5 promoter. More 
specifically, the AAV5 p5 promoter can be about same location in SEQ ED NO: 23 as the 

20 AAV2 p5 promoter, in the corresponding AAV2 published sequence. An example of an 
AAV5 p5 promoter is nucleotides 220-338 of SEQ ID NO: 23. Additionally, the p5 
promoter may be enhanced by nucleotides 1-130 of SEQ ID NO: 23. Furthermore, smaller 
fragments of p5 promoter that retain promoter activity can readily be determined by standard 
procedures including, for example, constructing a series of deletions in the p5 promoter, 

25 linking the deletion to a reporter gene, and determining whether the reporter gene is 

expressed, i.e., transcribed and/or translated. The promoter can be the promoter of any of the 
AAV serotypes, and can be the pl9 promoter (SEQ ID NO: 38) or the p40 promoter set 
forth in the sequence listing as SEQ ID NO: 39. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 

30 herein can be corrected, for example, by using the hybridization procedure described below 
with various probes derived from the described sequences such that the coding sequence can 
be reisolated and resequenced. Rapid screening for point mutations can also be achieved 
with the use of polymerase chain reaction single strand conformation polymorphism (PCR 
SSCP). The corresponding amino acid sequence can then be corrected accordingly. 

3 5 The AAV5-derived vector can include any normally occurring AAV5 sequences in 

addition to an ITR and promoter. Examples of vector constructs are provided below. 
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5 The present vector or AAV5 particle or recombinant AAV5 virion can utilize any 

unique fragment of the present AAV5 nucleic acids, including the AAV5 nucleic acids set 
forth in SEQ ID NOS: 23 and 29-33, 35, 37, 38, 39 and 40. To be unique, the fragment must 
be of sufficient size to distinguish it from other known sequences, most readily determined 
by comparing any nucleic acid fragment to the nucleotide sequences of nucleic acids in 

10 computer databases, such as GenBank. Such comparative searches are standard in the art. 
Typically, a unique fragment useful as a primer or probe will be at least about 8 or 10, 
preferable at least 20 or 25 nucleotides in length, depending upon the specific nucleotide 
content of the sequence. Additionally, fragments can be, for example, at least about 30, 40, 
50, 75, 100, 200 or 500 nucleotides in length and can encode polypeptides or be probes. The 

1 5 nucleic acid can be single or double stranded, depending upon the purpose for which it is 
intended. Where desired, the nucleic acid can be RNA. 

The present invention further provides an isolated AAV5 capsid protein to contain 
the vector. In particular, provided is not only a polypeptide comprising all three AAV5 coat 
proteins, i.e., VP1, VP2 and VP3, but also a polypeptide comprising each AAV5 coat 

20 protein individually, SEQ ID NOS: 26, 27, and 28, respectively. Thus an AAV5 particle 
comprising an AAV5 capsid protein comprises at least one AAV5 coat protein VP1, VP2 or 
VP3. An AAV5 particle comprising an AAV5 capsid protein can be utilized to deliver a 
nucleic acid vector to a ceil, tissue or subject. For example, the herein described AAV5 
vectors can be encapsidated in an AAV5 capsid-derived particle and utilized in a gene 

25 delivery method. Furthermore, other viral nucleic acids can be encapsidated in the AAV5 
particle and utilized in such delivery methods. For example, an AAV1, 2,3,4,or 6 vector 
(e.g. AAVl,2,3,4,or 6 ITR and nucleic acid of interest )can be encapsidated in an AAV5 
particle and administered. Furthermore, an AAV5 chimeric capsid incorporating both AAV2 
capsid and AAV5 capsid sequences can be generated, by standard cloning methods, 

30 selecting regions from the known sequences of each protein as desired. For example, 
particularly antigenic regions of the AAV2 capsid protein can be replaced with the 
corresponding region of the AAV5 capsid protein. In addition to chimeric capsids 
incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3, 4, or 6 and 
AAV5 capsid sequences can be generated, by standard cloning methods, selecting regions 

35 from the known sequences of each protein as desired. The particle can also comprise only 
VP1 and VP3 capsid proteins. 
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5 The capsids can also be modified to alter their specific tropism by genetically 

altering the capsid to encode a specific ligand to a cell surface receptor. Alternatively, the 
capsid can be chemically modified by conjugating a ligand to a cell surface receptor. By 
genetically or chemically altering the capsids, the tropism can be modified to direct AAV5 
to a particular cell or population of cells. The capsids can also be altered immunologically 

10 by conjugating the capsid to an antibody that recognizes a specific protein on the target cell 
or population of cells. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 

1 5 AAV5 particle comprising an AAV5 capsid protein. 

The herein described recombinant AAV5 nucleic acid derived vector can be 
encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle or an AAV6 
particle, a portion of any of these capsids, or a chimeric capsid particle as described above, 

20 by standard methods using the appropriate capsid proteins in the encapsidation process, as 
long as the nucleic acid vector fits within the size limitation of the particle utilized. The 
encapsidation process itself is standard in the art. The AAV5 replication machinery, i.e. the 
rep initiator proteins and other functions required for replication, can be utilized to produce 
the AAV5 genome that can be packaged in an AAV1, 2, 3, 4, 5 or 6 capsid. 

25 The recombinant AAV5 virion containing a vector can also be produced by 

recombinant methods utilizing multiple plasmids. In one example, the AAV5 rep nucleic 
acid would be cloned into one plasmid, the AAV5 ITR nucleic acid would be cloned into 
another plasmid and the AAV1, 2, 3, 4, 5 or 6 capsid nucleic acid would be cloned on 
another plasmid. These plasmids would then be introduced into cells. The cells that were 

30 efficiently transduced by all three plasmids, would exhibit specific integration as well as the 
ability to produce recombinant AAV5 virion. Additionally, two plasmids could be used 
where the AAV5 rep nucleic acid would be cloned into one plasmid and the AAV5 ITR and 
AAV5 capsid would be cloned into another plasmid. These plasmids would then be 
introduced into cells. The cells that were efficiently transduced by both plasmids, would 

35 exhibit specific integration as well as the ability to produce recombinant AAV5 virion. 

32 



WO 2006/029196 PCT/US2005/031837 

5 An AAV5 capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide 

can have greater than 56% overall homology to the polypeptide having the amino acid 
sequence encoded by nucleotides in SEQ ID NOS: 29, 30, 31, as shown in figures 4 and 5. 
The capsid protein can have about 70% homology, about 75% homology, 80% homology, 
85% homology, 90% homology, 95% homology, 98% homology, 99% homology, or even 

10 100% homology to the protein having the amino acid sequence encoded by the nucleotides 
set forth in SEQ ID NOS: 29, 30, or 31 . The percent homology used to identify proteins 
herein, can be based on a nucleotide-by-nucleotide comparison or more preferable is based 
on a computerized algorithm as described herein. Variations in the amino acid sequence of 
the AAV5 capsid protein are contemplated herein, as long as the resulting particle 

1 5 comprising an AAV5 capsid protein remains antigenically or immunologically distinct from 
AAV1, AAV2, AAV3, AAV4 or AAV6 capsid, as can be routinely determined by standard 
methods. Specifically, for example, ELISA and Western blots can be used to determine 
whether a viral particle is antigenically or immunologically distinct from AAV2 or the other 
serotypes. Furthermore, the AAV5 particle preferably retains tissue tropism distinction 

20 from AAV2, such as that exemplified in the examples herein. An AAV5 chimeric particle 
comprising at least one AAV5 coat protein may have a different tissue tropism from that of 
an AAV5 particle consisting only of AAV5 coat proteins, but is still distinct from the 
tropism of an AAV2 particle, in that it will infect some cells not infected by AAV2 or an 
AAV2 particle. 

25 The invention further provides a recombinant AAV5 virion, comprising an AAV5 

particle containing, i.e., encapsidating, a vector comprising a pair of AAV5 inverted 
terminal repeats. The recombinant vector can further comprise an AAV5 Rep-encoding 
nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. AAV5 Rep confers targeted 

30 integration and efficient replication, thus production of recombinant AAV5, comprising 
AAV5 Rep, yields more particles than production of recombinant AAV2. Since AAV5 is 
more efficient at replicating and packaging its genome, the exogenous nucleic acid inserted, 
or in the AAV5 capsids of the present invention, between the inverted terminal repeats can 
be packaged in the AAV1, 2, 3, 4, or 6 capsids to achieve the specific tissue tropism 

35 conferred by the capsid proteins. 
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5 The invention further contemplates chimeric recombinant ITRs that contains a rep 

binding site and a TRS site recognized by that Rep protein. By "Rep protein" is meant all 
four of the Rep proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" 
could be one or more of the Rep proteins described herein. One example of a chimeric ITR 
would consist of an AAV5 D region (SEQ ID NO: 45), an AAV5 TRS site (SEQ ID NO: 
10 43), an AAV2 hairpin and an AAV2 binding site. Another example would be an AAV5 D 
region, an AAV5 TRS site, an AAV3 hairpin and an AAV3 binding site. In these chimeric 
ITRs, the D region can be from AAV1, 2, 3, 4, 5 or 6. The hairpin can be derived from AAV 
1,2 3, 4, 5, 6. The binding site can be derived from any of AAV1, 2, 3, 4, 5 or 6. Preferably, 
the D region and the TRS are from the same serotype. 
1 5 The chimeric ITRs can be combined with AAV5 Rep protein and any of the AAV 

serotype capsids to obtain recombinant virion. For example, recombinant virion can be 
produced by an AAV5 D region, an AAV5 TRS site, an AAV2 hairpin, an AAV2 binding 
site, AAV5 Rep protein and AAV1 capsid. This recombinant virion would possess the 
cellular tropism conferred by the AAV1 capsid protein and would possess the efficient 
20 replication conferred by the AAV5 Rep. 

Other examples of the ITR, Rep protein and Capsids that will produce recombinant 
virion are provided in the list below: 
5ITR + 5Rep + 5Cap=virion 
5ITR + 5Rep + lCap=virion 
25 5ITR + 5Rep + 2Cap=virion 

5ITR + 5Rep + 3Cap=virion 
5ITR + 5Rep + 4Cap=virion 
5ITR + 5Rep + 6Cap=virion 
1ITR + IRep + 5Cap=virion 
30 2ITR + 2Rep + 5Cap=virion 

3ITR + 3Rep + 5Cap=virion 
4ITR + 4Rep + 5Cap=virion 
6ITR + 6Rep + 5Cap==virion 

In any of the constructs described herein, inclusion of a promoter is preferred. As 
35 used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 

AAV5 VP1, AAV5 VP2, AAV5 VP3, combinations thereof, functional fragments of any of 
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VP1, VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
described herein, can be chimeric recombinant lTRs as described elsewhere in the 
application. 

Conjugates of recombinant or wild-type AAV5 virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified AAV5 can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 
molecule. Also contemplated are virosomes that contain AAV5 structural proteins (AAV5 
capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize theAAV5 capsid or a 
unique region of the AAV5 capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 
to introduce DNA into cells. For example, the type 5 VP3 protein or fragment thereof, can 
be conjugated to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected 
using the targeting ability of the VP3 capsid protein to achieve the desired tissue tropism, 
specific to AAV5. Type 5 VP1 and VP2 proteins can also be utilized to introduce DNA or 
other molecules into cells. By further incorporating the Rep protein and the AAV TRS into 
the DNA-containing conjugate, cells can be transduced and targeted integration can be 
achieved. For example, if AAV5 specific targeted integration is desired, a conjugate 
composed of the AAV5 VP3 capsid, AAV5 rep or a fragment of AAV5 rep, AAV5 TRS, 
the rep binding site, the heterologous DNA of interest, and a lipid, can be utilized to achieve 
AAV5 specific tropism and AAV5 specific targeted integration in the genome. 

Further provided by this invention are chimeric viruses where AAV5 can be 
combined with herpes virus, heroes virus amplicons, baculovirus or other viruses to achieve 
a desired tropism associated with another virus. For example, the AAV5 ITRs could be 
inserted in the herpes virus and cells could be infected. Post-infection, the ITRs of AAV5 
could be acted on by AAV5 rep provided in the system or in a separate vehicle to rescue 
AAV5 from the genome. Therefore, the cellular tropism of the herpes simplex virus can be 
combined with AAV5 rep mediated targeted integration. Other viruses that could be utilized 
to construct chimeric viruses include, lentivirus, retrovirus, pseudotyped retroviral vectors, 
and adenoviral vectors. 
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5 The present invention further provides isolated nucleic acids of AAV5. For example, 

provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 23 (AAV5 genome). This nucleic acid, or portions thereof, can be inserted into vectors, 
such as plasmids, yeast artificial chromosomes, or other viral vector (particle), if desired, by 
standard cloning methods. The present invention also provides an isolated nucleic acid 

10 consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 23. The 

nucleotides of SEQ ID NO: 23 can have minor modifications and still be contemplated by 
the present invention. For example, modifications that do not alter the amino acid encoded 
by any given codon (such as by modification of the third, "wobble," position in a codon) can 
readily be made, and such alterations are known in the art. Furthermore, modifications that 

1 5 cause a resulting neutral (conserved) amino acid substitution of a similar amino acid can be 
made in a coding region of the genome. Additionally, modifications as described herein for 
the AAV5 components, such as the ITRs, the p5 promoter, etc. are contemplated in this 
invention. Furthermore, modifications to regions of SEQ ID NO: 23 other than in the ITR, 
TRS Rep binding site and hairpin are likely to be tolerated without serious impact on the 

20 function of the nucleic acid as a recombinant vector. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with any nucleic acid disclosed herein, including the entire AAV5 genome and 
any unique fragment thereof, including the Rep and capsid encoding sequences (e.g. SEQ ID 
NOS: 23, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, and 45). Specifically, the 

25 nucleic acid can selectively or specifically hybridize to an isolated nucleic acid consisting of 
the nucleotide sequence set forth in SEQ ID NO: 23 (AAV5 genome). The present invention 
further provides an isolated nucleic acid that selectively or specifically hybridizes with an 
isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 23 
(AAV5 genome). By "selectively hybridizes" as used herein is meant a nucleic acid that 

30 hybridizes to one of the disclosed nucleic acids under sufficient stringency conditions 
without significant hybridization to a nucleic acid encoding an unrelated protein, and 
particularly, without detectably hybridizing to nucleic acids of AAV2. Thus, a nucleic acid 
that selectively hybridizes with a nucleic acid of the present invention will not selectively 
hybridize under stringent conditions with a nucleic acid encoding a different protein or the 

35 corresponding protein from a different serotype of the virus, and vice versa. A "specifically 
hybridizing" nucleic acid is one that hybridizes under stringent conditions to only a nucleic 
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5 acid found in AAV5. Therefore, nucleic acids for use, for example, as primers and probes to 
detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 
that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 
probes for further hybridization or for amplification methods (e.g., polymerase chain 
reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 

10 can be designed that selectively hybridizes with both AAV5 and a gene of interest carried 
within the AAV5 vector (i.e., a chimeric nucleic acid). 

A nucleic acid that selectively hybridizes to any portion of the AAV5 genome is 
contemplated herein. Therefore, a nucleic acid that selectively hybridizes to AAV5 can be of 
longer length than the AAV5 genome, it can be about the same length as the AAV5 genome 

15 or it can be shorter than the AAV5 genome. The length of the nucleic acid is limited on the 
shorter end of the size range only by its specificity for hybridization to AAV5, i.e., once it is 
too short, typically less than about 5 to 7 nucleotides in length, it will no longer bind 
specifically to AAV5, but rather will hybridize to numerous background nucleic acids. 
Additionally contemplated by this invention is a nucleic acid that has a portion that 

20 specifically hybridizes to AAV5 and a portion that specifically hybridizes to a gene of 
interest inserted within AAV5. 

The present invention further provides an isolated nucleic acid encoding an adeno- 
associated virus 5 Rep protein. The AAV5 Rep proteins are encoded by open reading frame 
(ORF) 1 of the AAV5 genome. Examples of the AAV5 Rep genes are shown in the nucleic 

25 acid set forth in SEQ ID NO: 23, and include nucleic acids consisting essentially of the 

nucleotide sequences set forth in SEQ ID NOS: 32 (Rep52), 33 (Rep78), 35 (Rep40), and 37 
(Rep68), and nucleic acids comprising the nucleotide sequences set forth in SEQ ID NOS: 
32, 33, 35, and 37. However, the present invention contemplates that the Rep nucleic acid 
can include any one, two, three, or four of the four Rep proteins, in any order, in such a 

30 nucleic acid. Furthermore, minor modifications are contemplated in the nucleic acid, such as 
silent mutations in the coding sequences, mutations that make neutral or conservative 
changes in the encoded amino acid sequence, and mutations in regulatory regions that do not 
disrupt the expression of the gene. Examples of other minor modifications are known in the 
art. Further modifications can be made in the nucleic acid, such as to disrupt or alter 

35 expression of one or more of the Rep proteins in order to, for example, determine the effect 
of such a disruption; such as to mutate one or more of the Rep proteins to determine the 
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5 resulting effect, etc. However, in general, a modified nucleic acid encoding a Rep protein 
will have at least about 85%, about 90%, about 93%, about 95%, about 98% or 100% 
homology to the Rep nucleic sequences described herein e.g., SEQ ID NOS: , 11, 13 and 15 
32, 33, 35 and 37, and the Rep polypeptide encoded therein will have overall about 93%, 
about 95%, about 98%, about 99% or 100% homology with the amino acid sequence 

10 described herein, e.g., SEQ ID NOS: 24, 25, 34 and 36. Percent homology is determined by 
the techniques described herein. 

The present invention also provides an isolated nucleic acid that selectively or 
specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set forth in SEQ ID NOS: 32, 33, 35 and 37, and an isolated nucleic acid that selectively 

15 hybridizes'wiui a nucleic acid comprising the nucleotide sequence set forth in SEQ ID NOS: 
32, 33, 35 and 37. "Selectively hybridizing" and "stringency of hybridization" is defined 
elsewhere herein. 

As described above, provided is the nucleic acid encoding a Rep 40 protein and, in 
particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 

20 NO: 35, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 35, and a nucleic acid encoding the adeno-associated virus 5 protein having the 
amino acid sequence set forth in SEQ ID NO: 34. The present invention also provides the 
nucleic acid encoding a Rep 52 protein, and in particular an isolated nucleic acid comprising 
the nucleotide sequence set forth in SEQ ID NO: 32, an isolated nucleic acid consisting 

25 essentially of the nucleotide sequence set forth in SEQ ID NO: 32, and a nucleic acid 

encoding the adeno-associated virus 5 Rep protein having the amino acid sequence set forth 
in SEQ ID NO: 24. The present invention further provides the nucleic acid encoding a Rep 
68 protein and, in particular an isolated nucleic acid comprising the nucleotide sequence set 
forth in SEQ ID NO: 37, an isolated nucleic acid consisting essentially of the nucleotide 

30 sequence set forth in SEQ ID NO: 37, and a nucleic acid encoding the adeno-associated 
virus 5 protein having the amino acid sequence set forth in SEQ ID NO: 36. And, further, 
provided is the nucleic acid encoding a Rep 78 protein, and in particular an isolated nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 33, an isolated nucleic 
acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 33, and a 

35 nucleic acid encoding the adeno-associated virus 5 Rep protein having the amino acid 
sequence set forth in SEQ ID NO: 25. As described elsewhere herein, these nucleic acids 
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» ... 

can have minor modifications, including silent nucleotide substitutions, mutations causmg 
conservative amino acid substitutions in the encoded proteins, and mutations in control 
regions that do not or niinimally affect the encoded amino acid sequence. 

The present invention further provides a nucleic acid encoding the entire AAV5 
Capsid polypeptide. Furthermore, provided is a nucleic acid encoding each of the three 
AAV5 coat proteins, VP1, VP2, and VP3. Thus, provided is a nucleic acid encoding AAV5 
VP1, a nucleic acid encoding AAV5 VP2, and a nucleic acid encoding AAV5 VP3. Thus, 
provided is a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 26 
(VP1); a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 27(VP2), 
and a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 28 (VP3). 
The present invention also specifically provides a nucleic acid comprising SEQ ID NO: 29 
(VP1 gene); a nucleic acid comprising SEQ ID NO: 30 (VP2 gene); and a nucleic acid 
comprising SEQ ID NO: 31 (VP3 gene). The present invention also specifically provides a 
nucleic acid consisting essentially of SEQ ID NO: 29 (VP1 gene), a nucleic acid consisting 
essentially of SEQ ID NO: 30 (VP2 gene), and a nucleic acid consisting essentially of SEQ 
ID NO: 31 (VP3 gene). Minor modifications in the nucleotide sequences encoding the 
capsid, or coat, proteins are contemplated, as described above for other AAV5 nucleic acids. 
However, in general, a modified nucleic acid encoding a capsid protein will have at least 
about 85%, about 90%, about 93%, about 95%, about 98% or 100% homology to the capsid 
nucleic sequences described herein e.g., SEQ ID NOS: 29, 30 and 31, and the capsid 
polypeptide encoded therein will have overall about 93%, about 95%, about 98%, about 
99% or 100% homology with the amino acid sequence described herein, e.g., SEQ ID NOS: 
26, 27, and 28. Nucleic acids that selectively hybridize with the nucleic acids of SEQ ID 
NOS: 29, 30, and 31 under the conditions described above are also provided. 

Provided is an isolated AAV5 Rep protein. An AAV5 Rep polypeptide is encoded 
by ORF1 of AAV5. The present invention also provides each individual AAV5 Rep protein. 
Thus provided is AAV5 Rep 40 (e.g., SEQ ID NO: 34), or a unique fragment thereof. 
Provided is AAV5 Rep 52 (e.g., SEQ ID NO: 24), or a unique fragment thereof. Provided is 
AAV5 Rep 68 (e.g., SEQ ID NO: 36), or a unique fragment thereof. Provided is an example 
of AAV5 Rep 78 (e.g., SEQ ID NO: 25), or a unique fragment thereof. By "unique fragment 
thereof is meant any smaller polypeptide fragment encoded by an AAV5 rep gene that is of 
sufficient length to be found only in the Rep polypeptide. Substitutions and modifications of 



39 



WO 2006/029196 PCT/US2005/031837 

5 the amino acid sequence can be made as described above and, further, can include protein 
processing modifications, such as glycosylation, to the polypeptide. 

The present invention further provides an AAV5 Capsid polypeptide or a unique 
fragment thereof. AAV5 capsid polypeptide is encoded by ORF 2 of AAV5. The present 
invention further provides the individual AAV5 capsid proteins, VP1, VP2 and VP3 or 

10 unique fragments thereof. Thus, provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO: 26 (VP1). The present invention additionally provides an 
isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 27 (VP2). The 
present invention also provides an isolated polypeptide having the amino acid sequence set 
forth in SEQ ID NO: 28 (VP3). By "unique fragment thereof is meant any smaller 

15 polypeptide fragment encoded by any AAV5 capsid gene that is of sufficient length to be 
found only in the AAV5 capsid protein. Substitutions and modifications of the amino acid 
sequence can be made as described above and, further, can include protein processing 
modifications, such as glycosylation, to the polypeptide. However, an AAV5 Capsid 
polypeptide including all three coat proteins will have greater than about 56% overall 

20 homology to the polypeptide encoded by the nucleotides set forth in SEQ ID NOS: 26, 27, 
or 28. The protein can have about 65%, about 70%, about 75%, about 80%, about 85%, 
about 90%, 93%, 95%, 97% or even 100% homology to the amino acid sequence encoded 
by the nucleotides set forth in SEQ ID NOS: 26, 27 or 28. An AAV5 VP1 polypeptide can 
have at least about 58%, about 60%, about 70%, about 80%, about 90%, 93%, 95%, 97% or 

25 about 100% homology to the amino acid sequence set forth in SEQ ID NO: 26. An AAV5 
VP2 polypeptide can have at least about 58%, about 60%, about 70%, about 80%, about 
90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth in SEQ 
ID NO: 27. An AAV5 VP3 polypeptide can have at least about 60%, about 70%, about 
80%, about 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set 

30 forth in SEQ ID NO: 28. 

The AAV ITRs in the vector for the herein described delivery methods can be AAV5 
ITRs (SEQ ID NOS: 41 and 42). Furthermore, the AAV ITRs in the vector for the herein 
described nucleic acid delivery methods can also comprise AAV1, AAV2, AAV3, AAV4, or 
AAV6 inverted terminal repeats. 



40 



WO 2006/029196 PCT/US2005/031837 

5 Compositions and methods for making BAAV vectors 

Compositions and methods for making and using BAAV vectors have been 
previously described in U.S. Patent Application No. 60/526786, filed December 4, 2003, 
and in International Patent Application No. PCT/US04/40825, filed December 6, 2004, 
which are hereby incorporated herein by reference for this teaching. 

10 Provided is a recombinant bovine adeno-associated virus (BAAV). This virus has 

one or more of the characteristics described below. The compositions of the present 
invention do not include wild-type BAAV. The methods of the present invention can use 
either wild-type BAAV or recombinant BAAV-based delivery. 

Provided are novel BAAV particles, recombinant BAAV vectors and recombinant 

1 5 BAAV virions. An BAAV particle is a viral particle comprising an BAAV capsid protein. A 
recombinant BAAV vector is a nucleic acid construct that comprises at least one unique 
nucleic acid of BAAV. A recombinant BAAV virion is a particle containing a recombinant 
BAAV vector, wherin the particle can be either an BAAV particle as described herein or a 
non-BAAV particle. Alternatively, the recombinant BAAV virion is an BAAV particle 

20 containing a recombinant vector, wherein the vector can be either an BAAV vector as 

described herein or a non-BAAV vector. These vectors, particles, virions, nucleic acids and 
polypeptides are described below. 

Provided is the nucleotide sequence of the BAAV genome and vectors and particles 
derived therefrom. Specifically, provided is a nucleic acid vector comprising a pair of 

25 BAAV inverted terminal repeats (ITRs) and a promoter between the inverted terminal 
repeats. The rep proteins of AAV5 and BAAV will bind to the BAAV ITR and it can 
function as an origin of replication for packaging of recombinant AAV particles. The 
rninimum sequence necessary for this activity is the TRS site where Rep cleaves in order to 
replicate the virus. Minor modifications in an ITR are contemplated and are those that will 

30 not interfere with the hairpin structure formed by the ITR as described herein and known in 
the art. Furthermore, to be considered within the term e.g. it must retain the Rep binding site 
described herein. 

The D- region of the AAV2 ITR, a single stranded region of the ITR, inboard of the 
TRS site, has been shown to bind a factor which depending on its phosphorylation state 
35 correlates with the conversion of the AAV from a single stranded genome to a 

transcriptionally active form that allows for expression of the viral DNA. This region is 
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5 conserved between AAV2, 3, 4,and 6 but is divergent in AAV5 and BAAV (SEQ ID NO: 
59). The D+ region is the reverse complement of the D- region. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. That is, the promoter can be tissue/cell-specific. 

1 0 Promoters can be prokaryotic, eukaryotic, fungal, nuclear, mitochondrial, viral or plant 

promoters. Promoters can be exogenous or endogenous to the cell type being transduced by 
the vector. Promoters can include, for example, bacterial promoters, known strong 
promoters such as SV40 or the inducible metallotbionein promoter, or an AAV promoter, 
such as an AAV p5 promoter. Additionally, chimeric regulatory promoters for targeted gene 

1 5 expression can be utilized. Examples of these regulatory systems, which are known in the 
art, include the tetracycline based regulatory system which utilizes the tet transactivator 
protein (tTA), a chimeric protein containing the VP16 activation domain fused to the tet 
repressor of Escherichia coli, the IPTG based regulatory system, the CID based regulatory 
system, and the Ecdysone based regulatory system. Other promoters include promoters 

20 derived from actin genes, immunoglobulin genes, cytomegalovirus (CMV), adenovirus, 
bovine papilloma virus, adenoviral promoters, such as the adenoviral major late promoter, 
an inducible heat shock promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), 
etc., specifically, the promoter can be AAV2 p5 promoter or AAV5 p5 promoter or BAAV 
p5 promoter. More specifically, the BAAV p5 promoter can be in about the same location in 

25 SEQ ID NO: 47 as the AAV2 p5 promoter, in the corresponding AAV2 published sequence. 
Additionally, the p5 promoter may be enhanced by nucleotides 1-173 of SEQ ID NO: 47. 
Furthermore, smaller fragments of p5 promoter that retain promoter activity can readily be 
determined by standard procedures including, for example, constructing a series of deletions 
in the p5 promoter, linking the deletion to a reporter gene, and determining whether the 

30 reporter gene is expressed, i.e., transcribed and/or translated. The promoter can be the 
promoter of any of the AAV serotypes, and can be the pl9 promoter (SEQ ID NO: 62) or 
the p40 promoter set forth in the sequence listing as SEQ ID NO: 63. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 
herein can be corrected, for example, by using the hybridization procedure described below 

35 with various probes derived from the described sequences such that the coding sequence can 
be reisolated and resequenced. Rapid screening for point mutations can also be achieved 
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with the use of polymerase chain reaction single strand conformation polymorphism (PCR 
SSCP). The corresponding amino acid sequence can then be corrected accordingly. 

The BAAV-derived vector can include any normally occurring BAAV nucleic acid 
sequences in addition to an TTR and promoter. The BAAV-derived vector can also include 
sequences that are at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
the BAAV nucleic acids set forth herein. Examples of vector constructs are provided below. 

The present vector or BAAV particle or recombinant BAAV virion can utilize any 
unique fragment of these present BAAV nucleic acids, including the BAAV nucleic acids 
set forth in SEQ ID NOS: 47, 48, 50, 52, 54, 56 and 58-63. To be unique, the fragment must 
be of sufficient size to distinguish it from other known sequences, most readily determined 
by comparing any nucleic acid fragment to the nucleotide sequences of nucleic acids in 
computer databases, such as GenBank. Such comparative searches are standard in the art. 
Typically, a unique fragment useful as a primer or probe will be at least about 8 or 10, 
preferable at least 20 or 25 nucleotides in length, depending upon the specific nucleotide 
content of the sequence. Additionally, fragments can be, for example, at least about 30, 40, 
50, 75, 1 00, 200 or 500 nucleotides in length and can encode polypeptides or be probes. The 
nucleic acid can be single or double stranded, depending upon the purpose for which it is 
intended. Where desired, the nucleic acid can be RNA 

The present invention further provides a BAAV capsid protein to contain the vector. 
In particular, provided is not only a polypeptide comprising all three BAAV coat proteins, 
i.e., VP1, VP2 and VP3, but also a polypeptide comprising each BAAV coat protein 
individually, SEQ ID NOS: 53, 55, and 57, respectively. Thus, an BAAV particle 
comprising an BAAV capsid protein comprises at least one BAAV coat protein VP1, VP2 
or VP3. A BAAV particle comprising an BAAV capsid protein can be utilized to deliver a 
nucleic acid vector to a cell, tissue or subject. For example, the herein described BAAV 
vectors can be encapsidated in an AAV5 capsid-derived particle and utilized in a gene 
delivery method. Furthermore, other viral nucleic acids can be encapsidated in the BAAV 
particle and utilized in such delivery methods. For example, an AAV1-8 or AAAV vector 
(e.g. AAV1-8 or AAAV ITR and nucleic acid of interest) can be encapsidated in an BAAV 
particle and admniistereA Furthermore, a BAAV chimeric capsid incorporating both AAV1- 
8 or AAAV capsid and BAAV capsid sequences can be generated, by standard cloning 
methods, selecting regions from the known sequences of each protein as desired. For 
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5 example, particularly antigenic regions of the BAAV capsid protein can be replaced with the 
corresponding region of the BAAV capsid protein. In addition to chimeric capsids 
incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3-8, and 
AAV5 capsid sequences can be generated, by standard cloning methods, selecting regions 
from the known sequences of each protein as desired. Alternatively a chimeric capsid can be 

1 0 made by the addition of a plasmid that expresses AAV1-8 capsid proteins at a ratio with the 
BAAV capsid expression plasmid that allows only a few capsid proteins to be incorpated 
into the BAAV particle. Thus, for example, a chimeric particle may be constructed that 
contains 6 AAV2 capsid proteins and 54 BAAV capsid proteins if the complete capsid 
contains 60 capsid proteins. 

1 5 The capsids can also be modified to alter their specific tropism by genetically 

altering the capsid to encode a specific ligand to a cell surface receptor. Alternatively, the 
capsid can be chemically modified by conjugating a ligand to a cell surface receptor. By 
genetically or chemically altering the capsids, the tropism can be modified to direct BAAV 
to a particular cell or population of cells. The capsids can also be altered immunologically 

20 by conjugating the capsid to an antibody that recognizes a specific protein on the target cell 

or population of cells. 

It has been recently reported that insertion of foreign epitopes (RGD motif, LH 

receptor targeting epitope) in certain regions of AAV2 capsid can redirect viral tropism. 

However, AAV2 naturally infects a wide variety of cell types and complete retargeting of 
25 rAAV2 would be difficult to achieve. Provided are two regions in the capsid of BAAV that 

are on the virus surface and could tolerate substitution. These two regions are aa 257-264 

(GSSNASDT, SEQ ID NO:67) and aa 444-457 (TTSGGTLNQGNSAT, SEQ ID NO:68). 

Other regions of the BAAV capsid could also accommodate the substitution of amino acids 

that would allow for epitope presentation on the surface of the virus. All of these regions 
30 would have in common 1) Surface exposure 2) able to support a substitution of sequence to 

insert the epitope 3) still allow for capsid assembly. 

Because of the symmetry of the AAV particles, a substitution in one subunit of the 

capsid will appear multiple times on the capsid surface. For example the capsid is made of 

approximately 55 VP3 proteins. Therefore an epitope incorporated in the VP3 protein could 
35 be expressed 55 times on the surfece of each particle increasing the likelihood of the epitope 

forming a stable interaction with its target. In some cases this maybe too high of a ligand 
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5 density for functional binding or this high density of epitope may interfere with capsid 
formation. The epitope density could be lowered by introducing another plasmid into the 
packaging system for production of recombinant particles and the ratio between the 
packaging plasmid with the modified VP3 protein and the wt VP3 protein altered to balance 
the epitope density on the virus surface. 

10 Epitopes could be incorporated into the virus capsid for the purpose of 1) altering the 

tropism of the virus 2) blocking an immune response direct at the virus 3) developing a host 
immune response to the epitope for the purpose of vaccination. 

Examples of epitopes that could be added to BAAV capsids include but are not 
limited to: 

1 5 LH receptor binding epitope 

RGD integrin binding epitope 
CD13 binding epitope NGRAHA SEQ H> NO:69 
The Retanef polyprotein vaccine candidate for HIV-1 
single chain antibody fragments directed against tumor cells 
20 Endothelial cell binding epitope SIGYPLP SEQ ID NO:70 

serpin receptor ligand, KFNKPFVFLI SEQ ID NO:71 
protective B-cell epitope hemagglutinin (HA) 91-108 from influenza HA 
NDV B-cell immunodominant epitope (IDE) spanning residues 447 to 455 
Major immunogenic epitope for parvovirus B19 ( NISLDNPLENPSSLFDLVARIK, 
25 SEQ ID NO:72) that can elicit protective antibody titers. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 
BAAV particle comprising BAAV capsid proteins and also full particles. 
30 The herein described recombinant BAAV nucleic acid derived vector can be 

encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle or an AAV6 or 
AAV7 or an AAV8 or AAAV particle, a portion of any of these capsids, or a chimeric 
capsid particle as described above, by standard methods using the appropriate capsid 
35 proteins in the encapsidation process, as long as the nucleic acid vector fits within the size 
limitation of the particle utilized. The encapsidation process itself is standard in the art. The 
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BAAV replication machinery, i.e. the rep initiator proteins and other functions required for 
replication, can be utilized to produce the BAAV genome that can be packaged in an 
AAV1-8 or AAAV capsid. 

The recombinant BAAV virion containing a vector can also be produced by 
recombinant methods utilizing multiple plasmids. In one example, the BAAV rep nucleic 
acid would be cloned into one plasmid, the BAAV ITR nucleic acid would be cloned into 
another plasmid and the AAV1-8 capsid nucleic acid would be cloned on another plasmid. 
These plasmids would then be introduced into ceUs. The cells that were efficiently 
transduced by all three plasmids, would exhibit specific integration as well as the ability to 
produce BAAV recombinant virus. Additionally, two plasmids could be used where the 
BAAV rep nucleic acid would be cloned into one plasmid and the BAAV ITR and BAAV 
capsid would be cloned into another plasmid. These plasmids would then be introduced into 
cells. The cells that were efficiently transduced by both plasmids, would exhibit specific 
integration as well as the ability to produce BAAV recombinant virus. 

An BAAV capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide 
can overall have greater than 56% homology to the polypeptide having the amino acid 
sequence encoded by nucleotides in SEQ ID NOS: 52, 54 and 56. The capsid protein can 
have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by the nucleotides set forth in SEQ ID 
NOS: 52, 54 and 56. The percent homology used to identify proteins herein, can be based on 
a nucleotide-by-nucleotide comparison or more preferable is based on a computerized 
algorithm as described herein. Variations in the amino acid sequence of the BAAV capsid 
protein are contemplated herein, as long as the resulting particle comprising an BAAV 
capsid protein remains antigenically or immunologically distinct from AAV1-8 or AAAV 
capsid, as can be routinely determined by standard methods. Specifically, for example, 
ELISA and Western blots can be used to determine whether a viral particle is antigenically 
or immunologically distinct from AAV2 or the other serotypes. Furthermore, the BAAV 
particle preferably retains tissue tropism distinction from other AAVs, such as that 
exemplified in the examples herein. A BAAV chimeric particle comprising at least one 
BAAV coat protein may have a different tissue tropism from that of an BAAV particle 
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consisting only of BAAV coat proteins, but is still distinct from the tropism of an AAV2 
particle. 

The invention further provides a recombinant BAAV virion, comprising a BAAV 
particle containing, i.e., encapsidating, a vector comprising a pair of BAAV inverted 
terminal repeats. The recombinant vector can further comprise a BAAV Rep-encoding 
nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. 

The invention further contemplates chimeric recombinant ITRs that contain a rep 
binding site and a TRS site recognized by that Rep protein. By "Rep protein" is meant all 
four of the Rep proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" 
could be one or more of the Rep proteins described herein. One example of a chimeric ITR 
would consist of an BAAV D region (SEQ ID NO: 59), an BAAV TRS site (SEQ ID NO: 
60), an AAV2 hairpin and an AAV2 Rep binding site. Another example would be a BAAV 
D region, an BAAV TRS site, an AAV3 hairpin and an AAV3 Rep binding site. In these 
chimeric ITRs, the D region can be from AAV1-8 or AAAV. The hairpin can be derived 
from AAV 1-8 or AAAV. The binding site can be derived from any of AAV1-8 or AAAV. 
Preferably, the D region and the TRS are from the same serotype. 

The chimeric ITRs can be combined with BAAV Rep protein and any of the AAV 
serotype capsids to obtain recombinant virion. For example, recombinant virion can be 
produced by a BAAV D region, an BAAV TRS site, an AAV2 hairpin, an AAV2 binding 
site, BAAV Rep protein and AAV1 capsid. This recombinant virion would possess the 
cellular tropism conferred by the AAV1 capsid protein and would possess the efficient 
replication conferred by the BAAV Rep. 

Other examples of the ITR Rep protein and Capsids that will produce recombinant 
virus are provided in the list below but not limited to : 

BAAV ITR + BAAV Rep + BAAV Cap=virus 

AAV5 ITR + BAAV Rep + BAAV Cap=virus 

AAV5 ITR + BAAV Rep + AAV1 Cap=virus 

AAV5 ITR + BAAV Rep + AAV2 Cap=virus 

AAV5 ITR + BAAV Rep + AAV3 Cap=virus 

AAV5 ITR + BAAV Rep + AAV4 Cap=virus 

AAV5 ITR + BAAV Rep + AAV5 Cap=virus 
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5 AAV5 ITR + BAAV Rep + AAV6 Cap=virus 

AAV5 ITR + BAAV Rep + AAV7 Cap==virus 

AAV5 ITR + BAAV Rep + AAV8 Cap=virus 

BAAV ITR + AAV5 Rep + BAAV Cap=virus 

BAAV ITR + AAV5 Rep + AAV1 Cap=vims 
10 BAAV ITR + AAV5 Rep + AAV2 Cap=virus 

BAAV ITR + AAV5 Rep + AAV3 Cap=virus 

BAAV ITR + AAV5 Rep + AAV4 Cap=virus 

BAAV ITR + AAV5 Rep + AAV5 Cap=virus 

BAAV ITR + AAV5 Rep + AAV6 Cap=virus 
1 5 BAAV ITR + AAV5 Rep + AAV7 Cap=virus 

BAAV ITR + AAV5 Rep + AAV8 Cap=virus 

AAV5 ITR + AAV5 Rep + BAAV Cap=virus 

AAV1 ITR + AAV1 Rep + BAAV Cap=vinis 

AAV2 ITR + AAV2 Rep + BAAV Cap=virus 
20 AAV3 ITR + AAV3 Rep + BAAV Cap=virus 

AAV4 ITR + AAV4 Rep + BAAV Cap=virus 

AAV5 ITR + AAV5 Rep + BAAV Cap=virus 

AAV6 ITR + AAV6 Rep + BAAV Cap=virus 

AAV7 ITR + AAV7 Rep + BAAV Cap=virus 
25 AAV8 ITR + AAV8 Rep + BAAV Cap=virus 

In any of the constructs described herein, inclusion of a promoter is preferred. As 
used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 
BAAV VP1, BAAV VP2, BAAV VP3, combinations thereof, functional fragments of any 
of VP1, VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
30 described herein, can be chimeric recombinant ITRs as described elsewhere in the 
application. 

Conjugates of recombinant or wild-type BAAV virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified BAAV can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
35 to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 

molecule. Also contemplated are virosomes that contain BAAV structural proteins (BAAV 
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5 capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize the BAAV capsid or a 
unique region of the BAAV capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 
to introduce DNA into cells. For example, the BAAV VP3 protein or fragment thereof, can 

1 0 be conjugated to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected 
using the targeting ability of the VP3 capsid protein to achieve the desired tissue tropism, 
specific to BAAV. BAAV VP1 and VP2 proteins can also be utilized to introduce DNA or 
other molecules into cells. By further incorporating the Rep protein and the AAV TRS into 
the DNA-containing conjugate, cells can be transduced and targeted integration can be 

1 5 achieved. For example, if BAAV specific targeted integration is desired, a conjugate 

composed of the BAAV VP3 capsid, BAAV rep or a fragment of BAAV rep, BAAV TRS, 
the rep binding site, the heterologous DNA of interest, and a lipid, can be utilized to achieve 
BAAV specific tropism and BAAV specific targeted integration in the genome. 

Further provided by this invention are chimeric viruses where BAAV can be 

20 combined with herpes virus, baculovirus or other viruses to achieve a desired tropism 

associated with another virus. For example, the BAAV ITRs could be inserted in the herpes 
virus and cells could be infected. Post-infection, the ITRs of BAAV could be acted on by 
BAAV rep provided in the system or in a separate vehicle to rescue BAAV from the 
genome. Therefore, the cellular tropism of the herpes simplex virus can be combined with 

25 BAAV rep mediated targeted integration. Other viruses that could be utilized to construct 
chimeric viruses include lentivirus, retrovirus, pseudotyped retroviral vectors, and 
adenoviral vectors. 

The present invention further provides isolated nucleic acids of BAAV. For 
example, provided is an isolated nucleic acid comprising the nucleotide sequence set forth 

30 in SEQ ID NO: 47 (BAAV genome). This nucleic acid, or portions thereof, can be inserted 
into vectors, such as plasmids, yeast artificial chromosomes, or other viral vector (particle), 
if desired, by standard cloning methods. The present invention also provides an isolated 
nucleic acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 47. 
The nucleotides of SEQ ID NO: 47 can have minor modifications and still be contemplated 

35 by the present invention. For example, modifications that do not alter the amino acid 

encoded by any given codon (such as by modification of the third, 'Vobble," position in a 
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5 codon) can readily be made, and such alterations are known in the art. Furthermore, 

modifications that cause a resulting neutral (conserved) amino acid substitution of a similar 
amino acid can be made in a coding region of the genome. Additionally, modifications as 
described herein for the BAAV components, such as the ITRs, the p5 promoter, etc. are 
contemplated in this invention. Furthermore, modifications to regions of SEQ ID NO:-t 47 

1 0 other than in the ITR, TRS, Rep binding site and hairpin are likely to be tolerated without 
serious impact on the function of the nucleic acid as a recombinant vector. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with any nucleic acid disclosed herein, including the entire BAAV genome and 
any unique fragment thereof, including the Rep and capsid encoding sequences (e.g. SEQ ID 

1 5 NOS: 47, 48, 50, 52, 54, 56, 58, 59, 60, 61, 62, 63). Specifically, the nucleic acid can 

selectively or specifically hybridize to an isolated nucleic acid consisting of the nucleotide 
sequence set forth in SEQ ID NO: 47 (BAAV genome). The present invention further 
provides an isolated nucleic acid that selectively or specifically hybridizes with an isolated 
nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 47 (BAAV 

20 genome). By "selectively hybridizes" as used herein is meant a nucleic acid that hybridizes 
to one of the disclosed nucleic acids under sufficient stringency conditions without 
significant hybridization to a nucleic acid encoding an unrelated protein, and particularly, 
without detectably hybridizing to nucleic acids of AAV2. Thus, a nucleic acid that 
selectively hybridizes with a nucleic acid of the present invention will not selectively 

25 hybridize under stringent conditions with a nucleic acid encoding a different protein or the 
corresponding protein from a different serotype of the virus, and vice versa. A "specifically 
hybridizing" nucleic acid is one that hybridizes under stringent conditions to only a nucleic 
acid found in BAAV. Therefore, nucleic acids for use, for example, as primers and probes to 
detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 

30 that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 
probes for further hybridization or for amplification methods (e.g., polymerase chain 
reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both BAAV and a gene of interest carried 
within the BAAV vector (i.e., a chimeric nucleic acid). 

35 A nucleic acid that selectively hybridizes to any portion of the BAAV genome is 

contemplated herein. Therefore, a nucleic acid that selectively hybridizes to BAAV can be 
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5 of longer length than the BAAV genome, it can be about the same length as the B AAV 
genome or it can be shorter than the BAAV genome. The length of the nucleic acid is 
limited on the shorter end of the size range only by its specificity for hybridization to 
BAAV, i.e., once it is too short, typically less than about 5 to 7 nucleotides in length, it will 
no longer bind specifically to BAAV, but rather will hybridize to numerous background 

1 0 nucleic acids. Additionally contemplated by this invention is a nucleic acid that has a 

portion that specifically hybridizes to BAAV and a portion that specifically hybridizes to a 
gene of interest inserted within BAAV. 

The present invention further provides an isolated nucleic acid encoding a bovine 
adeno-associated virus Rep protein. The BAAV Rep proteins are encoded by open reading 

15 frame (ORF) 1 of the BAAV genome. Examples of the BAAV Rep genes are shown in the 
nucleic acid set forth in SEQ ID NO: 47, and include nucleic acids consisting essentially of 
the nucleotide sequences set forth in SEQ ID NOS: 48 (rep78), 4(rep52) and nucleic acids 
comprising the nucleotide sequences set forth in SEQ ID NOS: 48 and 50. However, the 
present invention contemplates that the Rep nucleic acid can include any one, two, three, or 

20 four of the four Rep proteins, in any order, in such a nucleic acid. Furthermore, minor 
modifications are contemplated in the nucleic acid, such as silent mutations in the coding 
sequences, mutations that make neutral or conservative changes in the encoded amino acid 
sequence, and mutations in regulatory regions that do not disrupt the expression of the gene. 
Examples of other minor modifications are known in the art. Further modifications can be 

25 made in the nucleic acid, such as to disrupt or alter expression of one or more of the Rep 
proteins in order to, for example, determine the effect of such a disruption; such as to mutate 
one or more of the Rep proteins to determine the resulting effect, etc. However, in general, a 
modified nucleic acid encoding a Rep protein will have at least about 85%, about 90%, 
about 93%, about 95%, about 98% or 100% homology to the Rep nucleic sequences 

30 described herein e.g., SEQ ID NOS: 48 and 50, and the Rep polypeptide encoded therein 
will have overall about 93%, about 95%, about 98%, about 99% or 100% homology with 
the amino acid sequence described herein, e.g., SEQ ID NOS: 49 and 51. Percent homology 
is determined by the techniques described herein. 

The present invention also provides an isolated nucleic acid that selectively or 

35 specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set forth in SEQ ID NOS: 48 and 50, and an isolated nucleic acid that selectively hybridizes 



51 



WO 2006/029196 PCT/US2005/031837 

5 with a nucleic acid comprising the nucleotide sequence set forth in SEQ ID NOS: 48 and 50. 
"Selectively hybridizing" and "stringency of hybridization" is defined elsewhere herein. 

As described above, provided is the nucleic acid encoding a Rep 78 protein and, in 
particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 48, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 

10 SEQ ID NO: 48, and a nucleic acid encoding the bovine adeno-associated virus protein 
having the amino acid sequence set forth in SEQ ID NO: 49. The present invention also 
provides the nucleic acid encoding a Rep 52 protein, and in particular an isolated nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 50, an isolated nucleic 
acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 50, and a 

1 5 nucleic acid encoding the bovine adeno-associated virus Rep 52 protein having the amino 
acid sequence set forth in SEQ ID NO: 51. As described elsewhere herein, these nucleic 
acids can have minor modifications, including silent nucleotide substitutions, mutations 
causing conservative amino acid substitutions in the encoded proteins, and mutations in 
control regions that do not or minimally affect the encoded amino acid sequence. 

20 The present invention further provides a nucleic acid encoding the entire BAAV 

Capsid polypeptide. Furthermore, provided is a nucleic acid encoding each of the three 
BAAV coat proteins, VP1, VP2, and VP3. Thus, provided is a nucleic acid encoding BAAV 
VP1, a nucleic acid encoding BAAV VP2, and a nucleic acid encoding BAAV VP3. Thus, 
provided is a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 53 

25 (VP1); a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 55 (VP2), 
and a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 57 (VP3). 
The present invention also specifically provides a nucleic acid comprising SEQ ID NO: 52 
(VP1 gene); a nucleic acid comprising SEQ ID NO: 54 (VP2 gene); and a nucleic acid 
comprising SEQ ID NO: 56 (VP3 gene). The present invention also specifically provides a 

30 nucleic acid consisting essentially of SEQ ID NO: 52 (VP1 gene), a nucleic acid consisting 
essentially of SEQ ID NO: 54 (VP2 gene), and a nucleic acid consisting essentially of SEQ 
ID NO: 56 (VP3 gene). Minor modifications in the nucleotide sequences encoding the 
capsid, or coat, proteins are contemplated, as described above for other BAAV nucleic 
acids. However, in general, a modified nucleic acid encoding a capsid protein will have at 

35 least about 85%, about 90%, about 93%, about 95%, about 98% or 100% homology to the 
capsid nucleic sequences described herein e.g., SEQ ID NOS: 52, 54 and 56, and the capsid 
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5 polypeptide encoded therein will have overall about 93%, about 95%, about 98%, about 
99% or 100% homology with the amino acid sequence described herein, e.g., SEQ ID NOS: 
53, 55 and 57. Nucleic acids that selectively hybridize with the nucleic acids of SEQ ID 
NOS: 52, 54 and 56 under the conditions described above are also provided. 

Provided is an isolated BAAV Rep protein. An BAAV Rep polypeptide is encoded 

10 by ORF1 of BAAV. The present invention also provides each individual BAAV Rep 
protein. Thus provided is BAAV Rep 52 (e.g., SEQ ID NO: 50), or a unique fragment 
thereof. Provided is BAAV Rep 78 (e.g., SEQ ID NO: 48), or a unique fragment thereof. By 
"unique fragment thereof 1 is meant any smaller polypeptide fragment encoded by an BAAV 
rep gene that is of sufficient length to be found only in the Rep polypeptide. Substitutions 

1 5 and modifications of the. amino acid sequence can be made as described above and, further, 
can include protein processing modifications, such as glycosylation, to the polypeptide. 

The present invention further provides a BAAV Capsid polypeptide or a unique 
fragment thereof. BAAV capsid polypeptide is encoded by ORF 2 of BAAV. The present 
invention further provides the individual BAAV capsid proteins, VP1, VP2 and VP3 or 

20 unique fragments thereof. Thus, provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO:52 (VP1). The present invention additionally provides an 
isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 54 (VP2). The 
present invention also provides an isolated polypeptide having the amino acid sequence set 
forth in SEQ ID NO:56 (VP3). By "unique fragment thereof is meant any smaller 

25 polypeptide fragment encoded by any BAAV capsid gene that is of sufficient length to be 
found only in the BAAV capsid protein. Substitutions and modifications of the amino acid 
sequence can be made as described above and, further, can include protein processing 
modifications, such as glycosylation, to the polypeptide. However, an BAAV Capsid 
polypeptide including all three coat proteins will have greater than about 56% overall 

30 homology to the polypeptide encoded by the nucleotides set forth in SEQ ID NOS: 52, 54 or 
56. The protein can have about 65%, about 70%, about 75%, about 80%, about 85%, about 
90%, 93%, 95%, 97% or even 100% homology to the amino acid sequence encoded by the 
nucleotides set forth in SEQ ID NOS: 52, 54 or 56. An BAAV VP1 polypeptide can have at 
least about 58%, about 60%, about 70%, about 80%, about 90%, 93%, 95%, 97% or about 

35 100% homology to the amino acid sequence set forth in SEQ ID NO: 53. An BAAV VP2 
polypeptide can have at least about 58%, about 60%, about 70%, about 80%, about 90%, 
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5 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth in SEQ ID 
NO: 55. An BAAV VP3 polypeptide can have at least about 60%, about 70%, about 80%, 
about 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth 
in SEQ ID NO: 57. 

The present invention also provides a method of producing the BAAV virus by 

1 0 transducing a cell with the nucleic acid encoding the virus. 

The present method further provides a method of delivering an exogenous 
(heterologous) nucleic acid to a cell comprising adniinistering to the cell an BAAV particle 
containing a vector comprising the nucleic acid inserted between a pair of AAV inverted 
terminal repeats, thereby delivering the nucleic acid to the cell. 

15 The AAV ITRsinthe vector for the herein described delivery methods can be AAV 

ITRs (SEQ ID NOS: 58). Furthermore, the AAV ITRs in the vector for the herein described 
nucleic acid delivery methods can also comprise AAV1-8 or AAAV inverted terminal 
repeats. 

Compositions and methods for making AAV7 vectors 

20 Compositions and methods for making and using AAV7 vectors have been 

previously described in Gao GP, et al. Proc Natl Acad Sci USA. 2002 Sep 3;99(18):1 1854- 
9; U.S. Patent Application 2003/0228282; and International Patent Application No. 
PCT/US02/33630, which are hereby incorporated by reference herein for the teaching of 
compositions and method for making and using AAV7 virions, vectors, and particles. 

25 Provided is a recombinant adeno-associated virus-7 (AAV7). This virus has one or 

more of the characteristics described below. The compositions of the present invention do 
not include wild-type AAV7. The methods of the present invention can use either wild-type 
AAV7 or recombinant AAV7-based delivery. 

Provided are AAV7 particles, recombinant AAV7 vectors and recombinant AAV7 

30 virions. An AAV7 particle is a viral particle comprising an AAV7 capsid protein. A 
recombinant AAV7 vector is a nucleic acid construct that comprises at least one unique 
nucleic acid of AAV7. A recombinant AAV7 virion is a particle containing a recombinant 
AAV7 vector, wherein the particle can be either an AAV7 particle as described herein or a 
non-AAV7 particle. Alternatively, the recombinant AAV7 virion is an AAV7 particle 

35 containing a recombinant vector, wherein the vector can be either an AAV7 vector as 
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5 described herein or a non-AAV7 vector. These vectors, particles, virions, nucleic acids and 
polypeptides are described below. 

The AAV7-derived vector can include any normally occurring AAV7 nucleic acid 
sequences. The AAV7-derived vector can also include sequences that are at least 60%, 70%, 
80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the AAV7 nucleic acids set forth 

1 0 herein. Examples of vector constructs are provided below. 

The present vector or AAV7 particle or recombinant AAV7 virion can utilize any 
unique fragment of the present AAV7 nucleic acids, including the AAV7 nucleic acids set 
forth in SEQ ID NO:64. Fragments can be, for example, at least about 30, 40, 50, 75, 100, 
200 or 500 nucleotides in length. The nucleic acid can be single or double stranded, 

1 5 depending upon the purpose for which it is intended. 

The present invention further provides an AAV7 capsid protein to contain the vector. 
In particular, provided is a polypeptide comprising AAV7 capsid protein, SEQ ID NO:66. 
An AAV7 particle comprising an AAV7 capsid protein can be utilized to deliver a nucleic 
acid vector to a cell, tissue or subject. For example, the herein described AAV7 vectors can 

20 be encapsidated in an AAV5 capsid-derived particle and utilized in a gene delivery method. 
Furthermore, other viral nucleic acids can be encapsidated in the AAV7 particle and utilized 
in such delivery methods. For example, an AAV1 -6, 8, BAAV or AAAV vector (e.g. 
AAV1-6, 8, BAAV or AAAV ITR and nucleic acid of interest) can be encapsidated in an 
AAV7 particle and administered. Furthermore, a AAV7 chimeric capsid incorporating 

25 AAV1-6, 8, BAAV or AAAV capsid, and AAV7 capsid sequences can be generated, by 
standard cloning methods, selecting regions from the known sequences of each protein as 
desired. For example, particularly antigenic regions of the AAV2 capsid protein can be 
replaced with the corresponding region of the AAV7 capsid protein. In addition to chimeric 
capsids incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3-6, 

30 8, BAAV and AAV5 capsid sequences can be generated, by standard cloning methods, 
selecting regions from the known sequences of each protein as desired. Alternatively a 
chimeric capsid can be made by the addition of a plasmid that expresses AAV1 , 3-6, 8, 
BAAV or AAV5 capsid proteins at a ratio with the AAV7 capsid expression plasmid that 
allows only a few capsid proteins to be incorpated into the AAV7 particle. Thus, for 

3 5 example, a chimeric particle may be constructed that contains 6 AAV2 capsid proteins and 
54 AAV7 capsid proteins if the complete capsid contains 60 capsid proteins. 
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5 The capsids can also be assembled into empty particles by expression in mammalian, 

bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 
AAV7 particle comprising AAV7 capsid proteins and also full particles. 

The herein described recombinant AAV7 nucleic acid derived vector can be 

10 encapsidated in* an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle, an AAV6, an 
AAV8, a BAAV particle or AAAV particle, a portion of any of these capsids, or a chimeric 
capsid particle as described above, by standard methods using the appropriate capsid 
proteins in the encapsidation process, as long as the nucleic acid vector fits within the size 

15 limitation of the particle utilized. The encapsidation process itself is standard in the art. The 
AAV7 replication machinery, i.e. the rep initiator proteins and other functions required for 
replication, can be utilized to produce the AAV7 genome that can be packaged in an AAV1- 
6, 8, BAAV or AAAV capsid. 

The recombinant AAV7 virion containing a vector can also be produced by 

20 recombinant methods utilizing multiple plasmids. In one example, the AAV7 rep nucleic 
acid would be cloned into one plasmid, the AAV2 ITR nucleic acid would be cloned into 
another plasmid and the AAV7 capsid nucleic acid would be cloned on another plasmid. 
These plasmids would then be introduced into cells. The cells that were efficiently 
transduced by all three plasmids, would exhibit specific integration as well as the ability to 

25 produce AAV7 recombinant virus. Additionally, two plasmids could be used where the 
AAV7 rep nucleic acid would be cloned into one plasmid and the AAV7 ITR and AAV7 
capsid would be cloned into another plasmid. These plasmids would then be introduced into 
cells. The cells that were efficiently transduced by both plasmids, would exhibit specific 
integration as well as the ability to produce AAV7 recombinant virus. 

30 An AAV7 capsid polypeptide encoding the entire VP 1 polypeptide can overall have 

greater than 56% homology to the polypeptide having the amino acid sequence encoded by 
nucleotides in SEQ ID NO:66. The capsid protein can have about 70% homology, about 
75% homology, 80% homology, 85% homology, 90% homology, 95% homology, 98% 
homology, 99% homology, or even 100% homology to the protein having the amino acid 

35 sequence encoded by the nucleotides set forth in SEQ ID NO:66. The percent homology 
used to identify proteins herein, can be based on a nucleotide-by-nucleotide comparison or 
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more preferable is based on a computerized algorithm as described herein. Variations in the 
amino acid sequence of the AAV7 capsid protein are contemplated herein, as long as the 
resulting particle comprising an AAV7 capsid protein remains antigenically or 
immunologically distinct from AAV1-6, 8, BAAV or AAAV capsid, as can be routinely 
determined by standard methods. Specifically, for example, ELBA and Western blots can 
be used to determine whether a viral particle is antigenically or immunologically distinct 
from AAV2 or the other serotypes. Furthermore, the AAV7 particle preferably retains 
tissue tropism distinction from other AAVs. An AAV7 chimeric particle comprising at least 
one AAV7 coat protein may have a different tissue tropism from that of an AAV7 particle 
consisting only of AAV7 coat proteins, but is still distinct from the tropism of an AAV2 
particle. 

The invention further provides a recombinant AAV7 virion, comprising an AAV7 
particle containing, i.e., encapsidating, a vector comprising a pair of AAV7 inverted 
terminal repeats. The recombinant vector can further comprise an AAV7 Rep-encoding 
nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. 

For example, recombinant virion can be produced by a AAV2 ITR AAV2 Rep 
protein and AAV7 capsid. This recombinant virion would possess the cellular tropism 
conferred by the AAV7 capsid protein and would possess the efficient replication conferred 
bytheAAV2Rep. 

Other examples of the ITR, Rep protein and Capsids that will produce recombinant 
virus are provided in the list below but not limited to : 
AAV5 ITR + AAV7 Rep + AAV1 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV2 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV3 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV4 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV5 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV6 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV7 Cap=virus 
AAV5 ITR + AAV7 Rep + AAV8 Cap=virus 
AAV5 ITR + AAV7 Rep + BAAV Cap=virus 
AAV5 ITR + AAV7 Rep + AAAV Cap=virus 
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5 AAV1 ITR + AAV1 Rep + AAV7 Cap=virus 

AAV2 ITR + AAV2 Rep + AAV7 Cap=virus 

AAV3 ITR + AAV3 Rep + AAV7 Cap=virus 

AAV4 ITR + AAV4 Rep + AAV7 Cap=virus 

AAV5 ITR + AAV5 Rep + AAV7 Cap=virus 
1 0 AAV6 ITR + AAV6 Rep + AAV7 Cap=virus 

AAV8 ITR + AAV8 Rep + AAV7 Cap=virus 

BAAV ITR + BAAV Rep + AAV7 Cap=virus 

AAAV ITR + AAAV Rep + AAV7 Cap=virus 

15 In any of the constructs described herein, inclusion of a promoter is preferred. As 

used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 
AAV7 VP1, AAV7 VP2, AAV7 VP3, combinations thereof, functional fragments of any of 
VP1, VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
described herein, can be chimeric recombinant ITRs as described elsewhere in the 

20 application. 

Conjugates of recombinant or wild-type AAV7 virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified AAV7 can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 

25 molecule. Also contemplated are virosomes that contain AAV7 structural proteins (AAV7 
capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize the AAV7 capsid or a 
unique region of the AAV7 capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 

30 to introduce DNA into cells. By "unique" is meant any smaller polypeptide fragment 
encoded by any AAV7 capsid gene that is of sufficient length to be unique to the AAV7 
Capsid protein. For example, the AAV7 VP1 protein or fragment thereof, can be conjugated 
to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected using the targeting 
ability of the VP1 capsid protein to achieve the desired tissue tropism, specific to AAV7. 

35 AAV7 VP1 proteins can also be utilized to introduce DNA or other molecules into cells. By 
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5 further incorporating an AAV Rep protein and an AAV TRS into the DNA-containing 
conjugate, cells can be transduced and targeted integration can be achieved. 

The present invention further provides isolated nucleic acids of AAV7. For example, 
provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO:64. This nucleic acid, or portions thereof, can be inserted into vectors, such as plasmids, 

10 yeast artificial chromosomes, or other viral vector (particle), if desired, by standard cloning 
methods. The present invention also provides an isolated nucleic acid consisting essentially 
of the nucleotide sequence set forth in SEQ ID NO:64. The nucleotides of SEQ ID NO:64 
can have minor modifications and still be contemplated by the present invention. For 
example, modifications that do not alter the amino acid encoded by any given codon (such 

15 as by modification of the third, "wobble," position in a codon) can readily be made, and 
such alterations are known in the art. Furthermore, modifications that cause a resulting 
neutral (conserved) amino acid substitution of a similar amino acid can be made in a coding 
region of the genome. 

The present invention also provides an isolated nucleic acid that selectively or 

20 specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set form in SEQ ID NO:64, and an isolated nucleic acid that selectively hybridizes with a 
nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO:64. "Selectively 
hybridizing" and "stringency of hybridization" is defined elsewhere herein. 

The present invention further provides an isolated nucleic acid encoding a AAV7 

25 Rep protein. The AAV7 Rep proteins are encoded by open reading frame (ORF) 1 of the 
AAV7 genome. Examples of the AAV7 Rep genes are shown in the nucleic acid set forth in 
nucleotides 334-2205 of SEQ ID NO:64, and include nucleic acids consisting essentially of 
the nucleotide sequences set forth in 334-2205 of SEQ ID NO:64 (rep78). Minor 
modifications are contemplated in the nucleic acid, such as silent mutations in the coding 

30 sequences, mutations that make neutral or conservative changes in the encoded amino acid 
sequence, and mutations in regulatory regions that do not disrupt the expression of the gene. 
Examples of other minor modifications are known in the art. Further modifications can be 
made in the nucleic acid, such as to disrupt or alter expression of one or more of the Rep 
proteins in order to, for example, determine the effect of such a disruption; such as to mutate 

35 one or more of the Rep proteins to determine the resulting effect, etc. However, in general, a 
modified nucleic acid encoding a Rep protein will have at least about 85%, about 90%, 
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5 about 93%, about 95%, about 98% or 100% homology to the Rep nucleic sequences 

described herein e.g., SEQ ID NOS:65, and the Rep polypeptide encoded therein will have 
overall about 93%, about 95%, about 98%, about 99% or 100% homology with the amino 
acid sequence described in SEQ ID NO:65. Percent homology is determined by the 
techniques described herein. 

10 The present invention further provides a nucleic acid encoding the entire AAV7 

Capsid polypeptide. Thus, provided is a nucleic acid encoding the amino acid sequence set 
forth in nucleotides 2222-4435 of SEQ ID NO:64 (VP1). Minor modifications in the 
nucleotide sequences encoding the capsid, or coat, proteins are contemplated, as described 
above for other AAV7 nucleic acids. However, in general, a modified nucleic acid encoding 

1 5 a capsid protein will have at least about 85%, about 90%, about 93%, about 95%, about 
98% or 100% homology to the capsid nucleic sequences described herein e.g., nucleotides 
2222-4435 o'f SEQ ID NO:64, and the capsid polypeptide encoded therein will have overall 
about 93%, about 95%, about 98%, about 99% or 100% homology with the amino acid 
sequence described herein, e.g., SEQ ID NO:66. 

20 AAV Vector Generation 

It is understood that as discussed herein the use of the terms "homology" and 
"identity" mean the same thing as similarity. Thus, for example, if the use of the word 
homology is used to refer to two non-natural sequences, it is understood that this is not 
necessarily indicating an evolutionary relationship between these two sequences, but rather 

25 is looking at the similarity or relatedness between their nucleic acid sequences. Many of the 
methods for determining homology between two evolutionarily related molecules are 
routinely applied to any two or more nucleic acids or proteins for the purpose of measuring 
sequence similarity regardless of whether they are evolutionarily related. 

In general, it is understood that one way to define any known variants and 

30 derivatives or those that might arise, of the disclosed nucleic acids and polypeptides herein, 
is through defining the variants and derivatives in terms of homology to specific known 
sequences. In general, variants of nucleic acids and polypeptides herein disclosed typically 
haveat least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence or the 

35 native sequence. Those of skill in the art readily understand how to determine the homology 



60 



WO 2006/029196 PCTAJS2005/031837 

5 of two polypeptides or nucleic acids. For example, the homology can be calculated after 
aligning the two sequences so that the homology is at its highest level. 

Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 

10 alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search 
for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 
(1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, 
and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
Science Dr., Madison, WI; the BLAST algorithm of Tatusova and Madden FEMS 

15 Microbiol. Lett. 174: 247-250 (1999) available from the National Center for Biotechnology 
Information (http://ww.ncbi.nlm.nm^ ), or by inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. 
Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 which 

20 are herein incorporated by reference for at least material related to nucleic acid alignment. It 
is understood that any of the methods typically can be used and that in certain instances the 
results of these various methods may differ, but the skilled artisan understands if identity is 
found with at least one of these methods, the sequences would be said to have the stated 
identity. 

25 For example, as used herein, a sequence recited as having a particular percent 

homology to another sequence refers to sequences that have the recited homology as 
calculated by any one or more of the calculation methods described above. For example, a 
first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using the Zuker 

30 calculation method even if the first sequence does not have 80 percent homology to the 

second sequence as calculated by any of the other calculation methods. As another example, 
a first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using both the 
Zuker calculation method and the Pearson and Lipman calculation method even if the first 

35 sequence does not have 80 percent homology to the second sequence as calculated by the 
Smith and Waterman calculation method, the Needleman and Wunsch calculation method, 
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5 the Jaeger calculation methods, or any of the other calculation methods. As yet another 
example, a first sequence has 80 percent homology, as defined herein, to a second sequence 
if the first sequence is calculated to have 80 percent homology to the second sequence using 
each of calculation melhods (although, in practice, the different calculation methods will 
often result in different calculated homology percentages). 

1 0 Stringency of hybridization is controlled by both temperature and salt concentration 

of either orboth of the hybridization and washing steps. Typically, the stringency of 
hybridization to achieve selective hybridization involves hybridization in high ionic strength 
solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the 
melting temperature at which half of the molecules dissociate from their hybridization 

1 5 partners) followed by washing at a combination of temperature and salt concentration 
chosen so that the washing temperature is about 5°C to 20°C below the Tm. The 
temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a 
labeled nucleic acid of interest and then washed under conditions of different stringencies. 

20 Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 

hybridizations. The washing temperatures can be used as described above to achieve 
selective stringency, as is known in the art. (Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987). A preferable stringent 

25 hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous 
solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
and washing, if desired, can be reduced accordingly as the degree of complementarity 
desired is decreased, and further, depending upon the G-C or A-T richness of any area 
wherein variability is searched for. Likewise, stringency of hybridization and washing, if 

30 desired, can be increased accordingly as homology desired is increased, and further, 

depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 

In vivo administration to a human subject or an animal model can be by any of many 
standard means for administering viruses, depending upon the target organ, tissue or cell. 
35 Virus particles can be administered orally, parenterally (e.g., intravenously), by 

intramuscular injection, intrarectally, by direct tissue or organ injection, by intraperitoneal 
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5 injection, topically, transdermal^, via aerosol delivery, via the mucosa or the like. Viral 
nucleic acids (non-encapsidated) can also be administered, e.g., as a complex with cationic 
liposomes, or encapsulated in anionic liposomes. The present compositions can include 
various amounts of the selected viral particle or non-encapsidated viral nucleic acid in 
combination with a pharmaceutical^ acceptable carrier and, in addition, if desired, may 

10 include other medicinal agents, pharmaceutical agents, carriers, adjuvants, diluents, etc. 
Parental administration, if used, is generally characterized by injection. Injectables can be 
prepared in conventional forms, either as liquid solutions or suspensions, solid forms 
suitable for solution or suspension in liquid prior to injection, or as emulsions. Dosages will 
depend upon the mode of administration, the disease or condition to be treated, and the 

1 5 individual subject's condition, but will be that dosage typical for and used in administration 
of other AAV vectors, such as AAV2 vectors. Often a single dose can be sufficient; 
however, the dose can be repeated if desirable. 

Administration of a recombinant AAV virion to the cell can be accomplished by any 
means, including simply contacting the particle, optionally contained in a desired liquid 

20 such as tissue culture medium, or a buffered saline solution, with the cells. The virion can 
be allowed to remain in contact with the cells for any desired length of time, and typically 
the virion is administered and allowed to remain indefinitely. For such in vitro methods, the 
virion can be administered to the cell by standard viral transduction methods, as known in 
the art and as exemplified herein. Titers of virus to administer can vary, particularly 

25 depending upon the cell type, but will be typical of that used for AAV transduction in 
general which is well known in the art. Additionally the titers used to transduce the 
particular cells in the present examples can be utilized. 

The cells that can be transduced by the present recombinant AAV virions can 
include any desired cell, such as the following cells and cells derived from the following 

30 tissues, human as well as other mammalian tissues, such as primate, horse, sheep, goat, pig, 
dog, rat, and mouse and avian species: Adipocytes, Adenocyte, Adrenal cortex, Amnion, 
Aorta, Ascites, Astrocyte, Bladder, Bone, Bone marrow, Brain, Breast, Bronchus, Cardiac 
muscle, Cecum, Cervix, Chorion, Cochlear, Colon, Conjunctiva, Connective tissue, Cornea, 
Dermis, Duodenum, Embryonic stem cells, Endometrium, Endothelium, Endothelial cells, 

35 Epithelial tissue, Epithelial cells, Epidermis, Esophagus, Eye, Fascia, Fibroblasts, Foreskin, 
Gastric, Glial cells, Glioblast, Gonad, Hepatic cells, Histocyte, Hair cells in the inner ear, 
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5 Ileum, Intestine, small Intestine, Jejunum, Keratinocytes, Kidney, Larynx, Leukocytes, 
Lipocyte, Liver, Lung, Lymph node, Lymphoblast, Lymphocytes, Macrophages, Mammary 
alveolar nodule, Mammary gland, Mastocyte, Maxilla, Melanocytes, Mesenchymal, 
Monocytes, Mouth, Myelin, Myoblasts Nervous tissue, Neuroblast, Neurons, Neuroglia, 
Osteoblasts, Osteogenic cells, Ovary, Palate, Pancreas, Papilloma, Peritoneum, Pituicytes, 

10 Pharynx, Placenta, Plasma cells, Pleura, Prostate, Rectum, Salivary gland, Skeletal muscle, 
Skin, Smooth muscle, Somatic, Spleen, Squamous, Stem cells, Stomach, Submandibular 
gland, Submaxillary gland, Synoviocytes, Testis, Thymus, Thyroid, Trabecular Trachea, 
Turbinate, Umbilical cord, Ureter, Uterus, and vestibular hair cells. 

Stringency of hybridization is controlled by both temperature and salt concentration 

15 of either or both of the hybridization and washing steps. Typically, the stringency of 

hybridization to achieve selective hybridization involves hybridization in high ionic strength 
solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the 
melting temperature at which half of the molecules dissociate from their hybridization 
partners) followed by washing at a combination of temperature and salt concentration 

20 chosen so that the washing temperature is about 5°C to 20°C below the Tm. The 
temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a 
labeled nucleic acid of interest and then washed under conditions of different stringencies. 
Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 

25 hybridizations. The washing temperatures can be used as described above to achieve 
selective stringency, as is known in the art. (Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York, 1989; Kunkel et al. Methods Enzymol 1987:154:367, 1987). A preferable stringent 
hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous 

30 solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
and washing, if desired, can be reduced accordingly as the degree of complementarity 
desired is decreased, and further, depending upon the G-C or A-T richness of any area 
wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
desired, can be increased accordingly as homology desired is increased, and further, 

35 depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 
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5 By the "suitability of an AAV vector for administration to a subjecf ' is meant a 

determination of whether the AAV vector will elicit a neutralizing immune response upon 
administration to a particular subject. A vector that does not elicit a significant immune 
response is a potentially suitable vector, whereas a vector that elicits a significant, 
neutralizing immune response (e.g. at least 90%) is thus likely to be unsuitable for use in 

10 that subject. Significance of any detectable immune response is a standard parameter 
understood by the skilled artisan in the field. For example, one can incubate the subject's 
serum with the virus, then determine whether that virus retains its ability to transduce cells 
in culture. If such virus cannot transduce cells in culture, the vector likely has elicited a 
significant immune response. 

1 5 Alternatively, or additionally, one skilled in the art could determine whether or not 

AAV administration would be suitable for a particular cell type of a subject. For example, 
the artisan could culture muscle cells in vitro and transduce the cells with AAV in the 
presence or absence of the subject's serum. If there is a reduction in transduction efficiency, 
this could indicate the presence of a neutralizing antibody or other factors that may inhibit 

20 transduction. Normally, greater than 90% inhibition would have to be observed in order to 
rule out the use of AAV-5 as a vector. However, this limitation could be overcome by 
treating the subject with an immunosuppressant that could block the factors inhibiting 
transduction. 

EXAMPLES 

25 The following examples are put forth so as to provide those of ordinary skill in the 

art with a complete disclosure and description of how the compounds, compositions, 
articles, devices and/or methods claimed herein are made and evaluated, and are intended to 
be purely exemplary of the invention and are not intended to limit the scope of what the 
inventors regard as their invention. Efforts have been made to ensure accuracy with respect 

30 to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be 

accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or 
is at ambient temperature, and pressure is at or near atmospheric. 

Example! 

Previous research had demonstrated that Caco-2 and MDCK cells are model cell 
35 lines for the study of macromolecular transport via transcytosis. Furthermore these cell lines 
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5 have been used to demonstrate transcytosis of both viruses and proteins. Therefore, to test if 
AAV can spread through tissue by transcytosis, 2xl0 8 DNA resistant particles of 
recombinant AAV2 (rAAV2) AAV4, AAV5, AAV6, BAAV suspended in 50ul of medium 
were placed in the upper (apical) side of the transwell polycarbonate filter over a monolayer 
of cells each of the following cells Caco-2, MDCKI, MDCKH, Human primary airways 

10 epithelia cells (Airway), Human primary immortalized epithelial endometrial, Bovine brain 
primary endothelia cells (BBB), or HeLa. All cultures had TERs indicating the formation of 
tight junctions and polarized phenotype. After 3 hours of incubation the medium in the basal 
side of the transwell was collected and tested for the presence of transcytosed rAAV DNA. 
Viral DNA was extracted from 200ul of basal medium and quantified by qPCR. 

1 5 In these cell lines, transcytosis was observed with several AAV serotypes and appeared 

to be both serotype and tissue-specific (Fig. 1). Three hours after the addition of AAV to the 
apical surface of the cells, over 800,000 particles of AAV5 were present in the media on the 
basal lateral side of the trans-well insert of CaCo-2 cells, but not the MDCK, airway epithelia, 
endometrial, or BBB cells (Fig. 1). Similarly BAAV particles were detected in the media on the 

20 basal lateral side of the MDCK, airways epithelia, endometrial, and BBB cells but not the Caco- 
2 cells. Interestingly, AAV4 was detected in the basal lateral media of all cell types. No virus 
was detected in the basal lateral media when AAV2 was added to the apical surface in either 
cell type. AAV6 did not transcytose in any of cell types tested, and was not tested on airway 
epithelia or BBB. HeLa cells do not form barrier epithelia and were used as a control. 

25 Example 2 

Previous work has demonstrated that transcytosis is a temperature dependent process 
than can be inhibited at 4°C. Transcytosis can also be inhibited by the addition of agents that 
selectively fix the plasma membrane. Recently the addition of tannic acid, a mild fixative 
agent, to the basal lateral surface blocked the transcytosis of GPI-anchored proteins to the 

30 apical surface (Polishchuk R, Nat Cell Biol 2004. 6(4):297-307). Therefore the ability of 
this agent to block the transcytosis of AAV was tested. Treatment of the basal lateral 
surface of either Caco-2 or MDCK cells prior to virus addition to the apical surface blocked 
the accumulation of AAV5 or BAAV in the basal lateral media. Furthermore, quantification 
of the intracellular virus demonstrated inhibition of exocytosis by tannic acid treatment 

35 dramatically increase the amount of AAV DNA in the cell suggesting the viral particles 
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5 detected in the basal lateral media are the result of an intracellular transport process and not 
aparacellularroute. 

Treatment of the basal lateral surface of Human primary airways epithelial cell 
(HAE) with tannic acid blocked the transcytosis of B AAV or AAV4 vector containing a 
GFP expression cassette from the apical surface to the basal lateral (Fig. 2). Furthermore 
10 transduction dramatically increased when assayed at 24 hrs post inoculation. Li contrast no 
change was observed in AAV2 transduction, which did not demonstrate any transcytosis 
activity and has limited binding activity on HAE. 

Example 3 

To confirm the DNA detected in the basal lateral media was indeed extracted from 

1 5 intact virus, the material was tested for DNase resistance after treatment with heat, ionic 
detergent or protease. The addition of DNase alone or in combination with the ionic 
detergent deoxycholine had no effect on the viral DNA present in the media suggesting it 
was not free DNA or complexed in lipid vesicles. However, heating to 95°C prior to 
treatment with DNAase completely degraded the viral DNA present in the media. This 

20 profile is identical to that of the input AAV particles and suggests the viral DNA is still 
encapsulated. Titration of the DNase resistant virus in the basal lateral media on Cos cells 
gave a similar particle to infectivity ratio to the input AAV particles. 

While it would appear the AAV DNA detected in the basal lateral media is 
contained in intact particles, its presence on the basal lateral surface could be the result of 

25 lyses of the cells or disruption of the monolayer. Therefore the TER was carefully monitored 
throughout the course of these experiments and was not observed to decrease. To further 
confirm the integrity of the cell monolayer, mixing experiments were studied in which two 
viruses with different gene cassettes were added to the apical surface at the same time and 
three hours post addition the amount of each virus in the basal lateral media was quantified 

30 using QPCR specific for each cassette. Both BAAV and AAV5 were able to pass from the 
apical to the basal lateral surface of MDCK or Caco cells respectively but the AAV2 did 
not Therefore the presence of viral particles in the basal lateral media does not appear to be 
the result of a disruption in the cell monolayer. 

Taken together this data suggest that dependoviruses particles are capable of passing 

35 through barrier epithelia via transcytosis and the process is both serotype and cell type 
specific. 
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5 Example 4 

To further characterize the transcytosis activity observed with AAV5 and BAAV, 
transcytosis was quantified as both a time and concentration dependent event. After the 
addition of particles to the apical surface, samples were removed from the basal lateral 
media at different time points and the amount of virus was quantified by QPCR of the 

10 extracted DNA. Viral genomes could be detected as soon as 30 minutes after addition and 
steadily increased with time By 24 hrs, over 1/3 of the input recombinant AAV5, 
BAAVvirus added to Caco or MDCK cells respectively had been transported to the basal 
lateral surface. In contrast, none of the input AAV2 or adenovirus was detected on the basal 
lateral side after 24 hrs. 

1 5 If transcytosis is an activity used by AAV to spread through tissue, this finding 

would help explain the lack of transduction of barrier epithelia reported with some isolates 
of AAV. Primary human bronchial airway epithelia (HAE) are known to transport albumin 
from the apical to the basal lateral surface by receptor-mediated transcytosis in vivo. While 
the interaction of BAAV with primary HAE has not been investigated, AAV4, 5 are 

20 reported to bind to HAE, however, for AAV4, this interaction does not result in 

transduction. Because of the interaction of AAV4 with O-link sialic acid„ it was proposed, 
and has been demonstrated, that mucins, which contained large amounts of O-linked sialic 
acid and are expressed on the apical surface of HAE, can block AAV4 transduction. 
Alternatively the lack of transduction could be the result of transcytosis of the virus through 

25 the tissue. 

To test this hypothesis, AAV2, 4, 5, BAAV were added to the apical surface of 
confluent monolayer cultures of primary human bronchial airway and transcytosis to the 
basal lateral surface was measured by QPCR after 3 hrs. All cultures had high TERs and 
expressed ciliated structures on their apical surface. Highly differentiated HAE cultures in 
30 contrast to immature cultures are resistant to transduction by adenoviral vectors due to a 
lack of integrin expression that is necessary for adenovirus entry. 

Of the 4 AAVs tested for transcytosis, AAV4 and BAAV were detected in the basal 
lateral media. No transport of AAV2 or AAVS was detected. As a control, adenovirus also 
was tested for transcytosis activity in the HAE cultures, but no transport was detected. 
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5 Example 5 

Epithelial cells that line the genitourinary tract form an important epithelial barrier 
layer and can transport proteins by transcytosis. AAV2, 4, 5 or BAAV were therefore tested 
to determine for the ability to penetrate this barrier epithelial layer by transcytosis. A well- 
characterized model of endometrial cells has been reported by Kyo et al. Following addition 
10 of the 4 AAVs to the apical surface, BAAV and AAV4 could be detected in the basal lateral 
media when assayed at 3hrs post inoculation (Fig. 1). 

Example 6 

Most AAVs were identified originally as contaminants of laboratory stocks of 
adenovirus, thus our understanding of their natural biology, cell tropism, and knowledge the 

15 cellular components required for virus entry is limited. For AAV5, in addition to N-linked 
sialic acid, the platelet derived growth factor (PDGF) receptors were identified as protein 
receptors for AAV5 pi Pasquale et al, Nat Med. 2003 Oct;9(10): 1306-12). This interaction 
was confirmed by modulation of PDGFR expression by transfection of expression plasmids, 
inhibitor treatment, or competition experiments with the extracellular domain of PDGFRcc. 

20 Likewise AAV5 transduction could be blocked with sialolactosamine conjugates kaludov et 
al2001. 

Previous research had demonstrated that transcytosis is actin dependent and occurs 
by a caviolin mediated pathway. Furthermore transcytosis can be blocked by treatment with 
tannic acid. Therefore to better characterize the transcytosis pathway utilized by AAVS in 
25 Caco cells the cells were treated with a panel of agents known to block either transcytosis in 
other systems or AAVS mediated transduction. It was noted that AAVS transcytosis could 
be inhibited by filipin and nocozodol as well as treatment with tannic acid. 

Caco cells, which actively transcytosis AAV5, are not reported to express PDGFR 
and are not transduced by AAV5. In agreement, competition experiments with sPDGFRa 
30 had little effect on AAV5 transcytosis. Furthermore, competition experiments with 200 
ug/ml sialolactosamine or 200 ug/ml heparin did not inhibited AAVS transcytosis. 

Both BSA and transferrin are reported to transcytosis through Caco cells via distinct 
receptor mediated pathways. However competition with either agent did not inhibit AAVS 
transcytosis suggesting the AAVS could use a distinct pathway. 
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5 In addition to confirming the intracellular nature of AAV5 transcytosis in Caco cells, the 
above experiments suggest that AAV5 transcytosis is occurring by a pathway independent 
of the one described for transduction. To confirm this Caco cells were stably transfected 
with PDGFRa and assayed for both transcytosis and transduction activity. Caco cells were 
not permissive for AAV5 transduction, however transduction dramatically increase 

1 0 following stable expression of PDGFRa. In contrast only a minor increase in transcytosis 
activity was detected in the Caco/PDGFRa cells. 

Throughout this application, various publications are referenced. The disclosures of 
these publications in their entireties are hereby incorporated by reference into this 
application in order to more fully describe the state of the art to which this invention 

15 pertains. 

It will be apparent to those skilled in the art that various modifications and variations 
can be made in the present invention without departing from the scope or spirit of the 
invention. Other embodiments of the invention will be apparent to those skilled in the art 
from consideration of the specification and practice of the invention disclosed herein. It is 
20 intended that the specification and examples be considered as exemplary only, with a true 
scope and spirit of the invention being indicated by the following claims. 
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CLAIMS 

What is claimed is: 

1 . A method of delivering a heterologous nucleic acid across an epithelial barrier 
comprising delivering to the epithelial barrier an AAV vector comprising the 
heterologous nucleic acid. 

2. The method of claim 1, wherein the epithelial cells are in the gut, lung, genitourinary 
tract, kidney, blood vessels or brain. 

3. The method of claim 1, wherein the epithelial cells can be selected from a group 
consisting of bronchial, alveolar, tracheal or upper airway epithelial cells; absorptive 
enterocytes; endometrial or urinary epithelial cells; renal collecting duct or proximal 
tubule epithelial cells; cerebral microvascular endothelial cells; or Choroidal Plexus 
epithelial cells . 

4. A method of transcytosing epithelial cells of a human subject comprising 
administering to the subject an AAV vector comprising a heterologous nucleic acid. 

5 . The method of claim 4, wherein the epithelial cells arp selected from a group 
consisting of bronchial, alveolar, tracheal or upper airway epithelial cells; absorptive 
enterocytes; endometrial or urinary epithelial cells; renal collecting duct or proximal 
tubule epithelial eels; cerebral microvascular endothelial cells; or Choroidal Plexus 
epithelial cells. 

6. A method of delivering a heterologous nucleic acid across human airway epithelial 
cells, comprising delivering to the cells a B AAV vector comprising the nucleic acid. 

7. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a BAAV vector 
comprising the nucleic acid. 

8. A method of delivering a heterologous nucleic acid across human endometrial 
epithelial cells, comprising delivering to the cells a BAAV vector comprising the 
nucleic acid. 

9. A method of delivering a heterologous nucleic acid across human kidney epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

10. A method of delivering a heterologous nucleic acid across human enterocytes, 
comprising delivering to the cells a AAV5 vector comprising the nucleic acid. 
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11. A method of delivering a heterologous nucleic acid across human airway epithelial 
cells, comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

12. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a AAV4 vector 
comprising the nucleic acid 

13. A method of delivering a heterologous nucleic acid across human endometrial 
epithelial cells, comprising delivering to the cells a AAV4 vector comprising the 
nucleic acid. 

14. A method of delivering a heterologous nucleic acid across human kidney epithelial 
cells, comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

15. A method of delivering a heterologous nucleic acid across human enterocytes 
comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

16. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a AAV7 vector 
comprising the nucleic acid. 

17. A method of delivering a heterologous nucleic acid across an epithelial barrier of the 
lung, comprising delivering to the lung a BAAV vector comprising the nucleic acid. 

18. The method of claiml7, wherein the epithelial barrier comprises human bronchial, 
alveolar, tracheal or upper airway epithelial cells. 

1 9. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
brain, comprising delivering to the brain a BAAV vector comprising the nucleic 
acid. 

20. The method of claiml 9, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

21. A method of delivering a heterologous nucleic acid across the epithelial barrier of 
blood vessels into the muscle, comprising delivering to the blood stream a BAAV 
vector comprising the nucleic acid. 

22. The method of claim 21, wherein the epithelial barrier comprises human vascular 
endothelial cells of the blood brain barrier. 
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23. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
genitourinary tract, comprising delivering to the genitourinary tract a BAAV vector 
comprising the nucleic acid genitourinary tract. 

24. The method of claim 23, wherein the epithelial barrier comprises human endometrial 
or urinary epithelial cells. 

25. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
kidney, comprising delivering to the genitourinary tract a BAAV vector comprising 
the nucleic acid genitourinary tract. 

26. The method of claim 25, wherein the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. 

27. A method of transcytosing lung epithelial cells of a subject comprising contacting 
the lung epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. 

28. The method of claim 27, wherein the epithelial cells are human bronchial, tracheal, 
or upper airway epithelial cells. 

29. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. 

30. The method of claim 29, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

31. A method of transcytosing vascular epithelial cells of a subject comprising 
contacting the vascular epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. 

32. The method of claim 3 1 , wherein the epithelial cells are human vascular endothelial 
cells of the blood brain barrier. 

33. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the genitourinary tract epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. 

34. The method of claim 33, wherein the epithelial cells are human endometrial or 
urinary tract epithelial cells. 
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35. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the kidney epithelial cells of the subject with a BAAV vector comprising 
a heterologous nucleic acid. 

36. The method of claim 35, wherein the epithelial cells are human renal collecting 
ducts or proximal tubules 

37. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
gut, comprising delivering to the gut an AAV5 vector comprising the nucleic acid. 

38. The method of claim 37, wherein the epithelial barrier comprises human absorptive 
enterocytes. 

39. A method of transcytosing gut epithelial cells of a subject comprising contacting the 
gut epithelial cells of the subject with an AAV5 vector comprising a heterologous 
nucleic acid. 

40. The method of claim 39, wherein the epithelial cells are human absorptive 
enterocytes. 

41 . A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
gut, comprising delivering to the gut an AAV4 vector comprising the nucleic acid. 

42. The method of claim 4 1 , wherein the epithelial barrier comprises human absorptive 
enterocytes. 

43. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
lung, comprising delivering to the lung an AAV4 vector comprising the nucleic acid. 

44. The method of claim 43, wherein the epithelial barrier comprises human bronchial, 
tracheal, or upper airway epithelial cells. 

45. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
CNS, comprising delivering to the CNS an AAV4 vector comprising the nucleic 
acid. 

46. The method of claim 45, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

47. A method of delivering a heterologous nucleic acid across the epithelial barrier of 
blood vessels into the muscle, comprising delivering to the blood stream an AAV4 
vector comprising the nucleic acid. 
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48. The method of claim 47, wherein the epithelial barrier comprises human vascular 
endothelial cells of the blood brain barrier. 

49. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
genitourinary tract, comprising delivering to the genitourinary tract an AAV4 vector 
comprising the nucleic acid. 

50. The method of claim 49, wherein the epithelial barrier comprises human endometrial 
or urinary epithelial cells. 

51. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
kidneys, comprising delivering to the kidneys an AAV4 vector comprising the 
nucleic acid. 

52. The method of claim 51, wherein the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. 

53. A method of transcytosing lung epithelial cells of a subject comprising contacting 
the lung epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

54. The method of 53, wherein the epithelial cells are human bronchial, tracheal, or 
upper airway epithelial cells. 

55. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

56. The method of claim 55, wherein the epithelial cells are human cerebral 

j microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

57. A method of transcytosing vascular epithelial cells of a subject comprising 
contacting the vascular epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. 

58. The method of claim 57, wherein the epithelial cells are vascular endothelial cells of 
the blood brain barrier. 

59. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the genitourinary epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. 
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60. The method of claim 59, wherein the epithelial cells are human endometrial or 
urinary epithelial cells. 

61 . A method of transcytosing kidney epithelial cells of a subject comprising contacting 
the kidney epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

62. The method of claim 6 1 , wherein the epithelial cells are human renal collecting 
ducts or proximal tubules 

63. A method of transcytosing gut epithelial cells of a subject comprising contacting the 
CNS epithelial cells of the subject with an AAV4 vector comprising a heterologous 
nucleic acid. 

64. The method of claim 63, wherein the epithelial cells are human absorptive 
enterocytes. 

65. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
brain, comprising delivering to the brain a AAV7 vector comprising the nucleic acid. 

66. The method of claim 65, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

67. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with a AAV7 vector comprising a 
heterologous nucleic acid. 

68. The method of claim 67, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 
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SEQUENCE LISTING 

<110> The Government of the united States of America, as represented by The 

Secretary, Department of Health and Human Services, National institutes of 
Health 

Chiorini, 3ohn A. 
DePasquale, Giovanni 

<120> TRANSCYTOSIS OF ADENO-ASSOCIATED VIRUSES 

<130> 14014/0427P1 

<140> TBA 

<141> 2005-09-08 

<150> 60/607,854 
<151> 2004-09-08 

<160> 72 

<170> FastSEQ for Windows version 4.0 

<210> 1 
<211> 4768 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<221> misc_feature 
<222> (0)... CO) 
<223> n=a,t,c, or g 

<221> variation 
<222> CO)... CO) 
<223> Xaa=any amino acid 



<400> 1 

ttggccactc 

agactgccgg 

gccaactcca 

gtccctgtat 

aagctgccac 

gagggtatat 

gagcagcagc 

agcacctgcc 

tgccgccgga 

aaaagctgca 

tcttctttgt 

ccgtgggcgt 

tgacccgcat 

cgcgtaatgg 

tgctccccaa 

gcgcctgttt 

cgcagacgca 

ggtcaaaaac 

cgtcagaaaa 

ccaactcgcg 

caaagacggc 



cctctatgcg 
cctctggccg 
tcatctaggt 
tagcagtcac 
gtcacagcca 
aaccgcgagt 
catgccgggg 
cggcatttct 
ttctgacatg 
acgcgagttc 
ccagttcgag 
caaatccatg 
ctaccgcggg 
cgccggaggc 
gacccagccc 
gaatctcgcg 
ggagcagaac 
ctccgccagg 
gcaatggatc 
gtcacaaatc 
tccggactac 



cgctcgctca 
gcagggccga 
ttgcccactg 
gtgagtgtcg 
cgtggtccgt 
gagccagcga 
ttctacgaga 
gactcttttg 
gacttgaatc 
ctggtcgagt 
aagggggaca 
gtggtgggcc 
gtcgagccgc 
gggaacaagg 
gagctccagt 
gagcgtaaac 
aaggaaaacc 
tacatggagc 
caggaggacc 
aaggccgcgc 
ctggtgggcc 



ctcactcggc 
gtgagtgagc 
acgtcaatgt 
tatttcgcgg 
ttgcgacagt 
ggagctccat 
tcgtgctgaa 
tgagctgggt 
tgattgagca 
ggcgccgcgt 
gctacttcca 
gctacgtgag 
agcttccgaa 
tggtggacga 
gggcgtggac 
ggctggtggc 
agaaccccaa 
tggtcgggtg 
aggcgtccta 
tggacaatgc 
agaacccgcc 



cctggagacc 
gagcgcgcat 
gacgtcctag 
agcgtagcgg 
ttgcgacacc 
tttgcccgcg 
ggtgcccagc 
ggccgagaag 
ggcacccctg 
gagtaaggcc 
cctgcacatc 
ccagattaaa 
ctggttcgcg 
ctgctacatc 
taacatggac 
gcagcatctg 
ttctgacgcg 
gctggtggac 
catctccttc 
ctccaaaatc 
ggaggacatt 



aaaggtctcc 
agagggagtg 
ggttagggag 
agcgcatacc 
atgtggtcag 
aattttgaac 
gacctggacg 
gaatgggagc 
accgtggccg 
ccggaggccc 
ctggtggaga 
gagaagctgg 
gtgaccaaga 
cccaactacc 
cagtatataa 
acgcacgtgt 
ccggtcatca 
cgcgggatca 
aacgccgcct 
atgagcctga 
tccagcaacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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gcatctaccg 
tgggctgggc 
cgacgggtaa 
tgaactggac 
gggaggaggg 
gcaaggtgcg 
tcgtcacctc 
accaacaacc 
actttggcaa 
tgaccgaggt 
ccaatgacgc 
cgtcagacgc 
acgtgggtat 
tggacatttg 
ctcaacccgt 
tcatggggag 
atgactgtga 
gattggctag 
gcccctaaac 
ggttacaaat 
gacgcggcag 
ccctacctca 
ccgtttgggg 
cttggtctgg 
tccccccagc 
aagaagctcg 
tccggagcca 
ggsggacaag 
tggtctgagg 
aaccacctnt 
accccctggg 
cagcgactca 
aacatccagg 
accagcacgg 
ggtcaagagg 
tactgtggac 
tgcctggagt 
agttttgaga 
atgaaccctc 
ctgaatgccg 
tttaaaaaga 
aatcaaaact 
agcactctgg 
cctgcggaca 
aacacggcca 
aacgccaccg 
ctgccgaccg 
agagacattt 
cacccctcac 
aagaacaccc 
ttcattactc 
gagcggtcca 
tctctgttgt 
cgctacctca 
gttgaacttt 
taagcagcgg 
tctggcaaac 
ggccctggag 
agcgagcgcg 

<210> 2 
<211> 623 
<212> PRT 



aatcctcgag 
gcaaaagaag 
aaccaacatc 
caatgagaac 
caagatgacg 
cgtggaccaa 
caacaccaac 
actccaggac 
ggtcaccaag 
gactcacgag 
agatataagt 
ggaagctccg 
gaatctgatg 
cttcacgcac 
gtctgtcgtc 
ggcgcccgag 
catggaacaa 
aggacaacct 
ccaaggcaaa 
acctcggacc 
ccctcgagca 
agtacaacca 
gcaacctcgg 
ttgagcaagc 
agcccgactc 
ttttcgaaga 
tgtctgatga 
gtgccgatgg 
gccacgtcac 
acaagcgact 
gatactttga 
tcaacaacaa 
tcaaggaggt 
ttcagatctt 
gcagcctgcc 
tggtgaccgg 
actttccttc 
aggtgccttt 
tcatcgacca 
ggactgccac 
actggctgcc 
acaagatccc 
acggaagatg 
gcaagttcag 
ccgtacccgg 
atacggacat 
tggacagact 
actaccaggg 
cgctgattgg 
cggtacctgc 
agtacagcac 
aacgctggaa 
gggctcccga 
cccaccacct 
ggtctccgtg 
cctgcggcgc 
catgatgatg 
accaaaggtc 
catagaggga 



atgaacgggt 
ttcgggaaga 
gcggaagcca 
tttccgttca 
gccaaggtcg 
aagtgcaagt 
atgtgcgcgg 
cggatgttca 
caggaagtca 
ttttacgtca 
gagcccaagc 
gtggactacg 
ctttttccct 
ggggtcatgg 
agaaagcgga 
gtggcctgct 
taaatgactc 
ctctgaaggc 
tcaacaacat 
cggcaacgga 
cgacaaggcc 
cgccgacgcg 
cagagcagtc 
gggtgagacg 
ctccacgggt 
cgaaactgga 
cagtgagatg 
agtgggtaat 
gaccaccagc 
cggagagagc 
cttcaaccgc 
ctggggcatg 
cacgacgtcg 
tgcggactcg 
tccttttccc 
caacacttcg 
gcagatgctg 
ccactcgatg 
gtacctgtgg 
caccaacttt 
cgggccttca 
tgccaccggg 
gagtgccctg 
caacagccag 
gactctgatc 
gtggggcaac 
gacagccttg 
tcccatttgg 
tgggtttggg 
gaatcctgca 
tggccaggtg 
ccccgaggtc 
tgcggctggg 
gtaataacct 
tccttcttat 
ttgcgcttcg 
gagttggcca 
tccagactgc 
gtggccaa 



acgatccgca 
ggaacaccat 
tcgcccacgc 
acgattgcgt 
tagagagcgc 
catcggccca 
tcatcgacgg 
agttcgagct 
aagacttttt 
gaaagggtgg 
gggcctgtcc 
cggacaggta 
gccggcaatg 
actgtgccga 
cgtatcagaa 
cggcctgcga 
aaaccagata 
gttcgagagt 
caggacaacg 
ctcgacaagg 
tacgaccagc 
gagttccagc 
ttccaggcca 
gctcctggaa 
atcggcaaaa 
gcaggcgacg 
cgtgcagcag 
gcctcgggtg 
accagaacct 
ctgcagtcca 
ttccactgcc 
cgacccaaag 
aacggcgaga 
tcgtacgaac 
aacgacgtct 
cagcaacaga 
cggactggca 
tacgcgcaca 
ggactgcaat 
accaagctgc 
atcaagcagc 
tcagacagtc 
acccccggac 
ctcatctttg 
ttcacctctg 
ctacctggcg 
ggagccgtgc 
gccaagattc 
ctgaaacacc 
acgaccttca 
tcggtgcaga 
cagtttacct 
aaatacactg 
gttaatcaat 
cttatctcgt 
cggtttacaa 
ctccctctat 
cggcctctgg 



gtacgcggcc 
ctggctcttt 
cgtgcccttc 
cgacaagatg 
caaggccatc 
gatcgaccca 
aaactcgacc 
caccaagcgc 
ccggtgggcg 
agctagaaag 
gtcagttgcg 
ccaaaacaaa 
cgagagaatg 
gtgcttcccc 
actgtgtccg 
actggccaat 
tgactgacgg 
ggtgggcgct 
ctcggggtct 
gggaacccgt 
agctcaaggc 
agcggcttca 
aaaagagggt 
agaagagacc 
aaggcaagca 
gaccccctga 
ctggcggagc 
attggcattg 
gggtcttgcc 
acacctacaa 
acttctcacc 
ccatgcgggt 
caacggtggc 
tgccgtacgt 
ttatggtgcc 
ctgacagaaa 
acaactttga 
gccagagcct 
cgaccaccac 
ggcctaccaa 
agggcttctc 
tcatcaaata 
ctccaatggc 
cggggcctaa 
aggaggagct 
gtgaccagag 
ctggaatggt 
ctcataccga 
cgcctcctca 
gctctactcc 
ttgactggga 
ccaactacgg 
agcctagggc 
aaaccggttt 
ttccatggct 
ctgccggtta 
gcgcgctcgc 
ccggcagggc 



tccgtcttcc 
gggccggcca 
tacggctgcg 
gtgatctggt 
ctgggcggaa 
actcccgtga 
accttcgagc 
ctggagcacg 
tcagatcacg 
aggcccgccc 
cagccatcga 
tgttctcgtc 
aatcagaatg 
gtgtcagaat 
attcatcaca 
gtggacttgg 
ttaccttcca 
gcaacctgga 
tgtgcttccg 
caacgcagcg 
cggtgacaac 
gggcgacaca 
tcttgaacct 
gttgattgaa 
gccggctaaa 
gggatcaact 
tgcagtcgag 
cgattccacc 
cacctacaac 
cggattctcc 
acgtgactgg 
caaaatcttc 
taataacctt 
gatggatgcg 
ccagtacggc 
tgccttctac 
aattacgtac 
ggaccggctg 
cggaaccacc 
cttttccaac 
aaagactgcc 
cgagacgcac 
cacggctgga 
acagaacggc 
ggcagccacc 
caacagcaac 
ctggcaaaac 
tggacacttt 
aatttttatc 
ggtaaactcc 
gatccagaag 
acagcaaaac 
tatcggtacc 
attcgtttca 
actgcgtaca 
atcagtaact 
tcactcactc 
cgagtgagtg 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4768 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 2 

Met Pro Gly Phe Tyr Glu lie val Leu Lys Val Pro Ser Asp Leu Asp 

1 5 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Ser Trp val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr val Ala Glu Lys Leu Gin Arg Glu Phe Leu 

50 55 60 

Val Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 . 80 

Gin Phe Glu Lys Gly Asp ser Tyr Phe His Leu His lie Leu val Glu 

85 " 90 95 

Thr val Gly Val Lys Ser Met val Val Gly Arg Tyr Val ser Gin He 

100 105 110 

Lys Glu Lys Leu Val Thr Arg lie Tyr Arg Gly Val Glu Pro Gin Leu 

115 120 125 

pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val val Asp Asp Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Asp Gin Tyr lie 

165 170 175 

ser Ala cys Leu Asn Leu Ala Glu Arg Lys Arg Leu val Ala Gin His 

180 185 ~ 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu val Asp Arg Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala ser Lys 

260 265 270 

He Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu val Gly Gin Asn 

275 280 285 

pro Pro Glu Asp lie Ser ser Asn Arg lie Tyr Arg lie Leu Glu Met 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

cys val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 , 380 

Lys val val Glu ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

val Asp Gin Lys cys Lys ser Ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

lie val Thr ser Asn Thr Asn Met Cys Ala val lie Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 
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Glu Leu Thr Lys Arg Leu Glu His Asp pne Giy Lys vai Thr Lys Gin 

450 " 455 460 

Glu val Lys Asp Phe Phe Arg Trp Ala Ser Asp His val Thr Glu Val 
465 470 475 480 

Thr His Glu Phe Tyr val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

485 490 495 

pro Asn Asp Ala Asp lie Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 

500 505 510 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

515 520 525 

Arq Tyr Gin Asn Lys Cys ser Arg His Val Gly Met Asn Leu Met Leu 

y 530 . . 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn val Asp He cys 
545 550 555 560 

Phe Thr His Gly val Met Asp Cys Ala Glu Cys Phe Pro Val ser Glu 

565 570 575 

ser Gin Pro val Ser Val Val Arg Lys Arg Thr Tyr Gin Lys Leu Cys 

580 585 590 

Pro lie His His lie Met Gly Arg Ala Pro Glu Val Ala cys Ser Ala 

595 600 605 

cys Glu Leu Ala Asn Val Asp Leu Asp Asp cys Asp Met Glu Gin 
610 615 620 

<210> 3 
<211> 2495 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 3 , 

Ala Thr Gly cys Cys Gly Gly Gly Gly Thr Thr Cys Thr Ala Cys Gly 

1 5 10 15 

Ala Gly Ala Thr cys Gly Thr Gly cys Thr Gly Ala Ala Gly Gly Thr 

20 25 30 

Gly cys Cys Cys Ala Gly cys Gly Ala Cys Cys Thr Gly Gly Ala Cys 

35 40 45 

Met Pro Gly Phe Tyr Glu lie Val Leu Lys Val Pro ser Asp Leu Asp 

50 55 60 

Gly Ala Gly Cys Ala Cys Cys Thr Gly Cys Cys Cys Gly Gly cys Ala 
65 70 75 . . 8 ? 

Thr Thr Thr cys Thr Gly Ala Cys Thr Cys Thr Thr Thr Thr Gly Thr 

85 90 95 

Gly Ala Gly Cys Thr Gly Gly Gly Thr Gly Gly cys Cys Gly Ala Gly 

100 105 110 

Glu His Leu Pro Gly lie ser Asp ser Phe val ser Trp val Ala Glu 

115 120 n 125 

Ala Ala Gly Gly Ala Ala Thr Gly Gly Gly Ala Gly cys Thr Gly Cys 

130 135 140 

cys Gly cys Cys Gly Gly Ala Thr Thr Cys Thr Gly Ala cys Ala Thr 
145 150 155 160 

Gly Gly Ala cys Thr Thr Gly Ala Ala Thr cys Thr Gly Ala Thr Thr 

165 170 175 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 

180 185 190 

Gly Ala Gly Cys Ala Gly Gly cys Ala Cys Cys Cys Cys Thr Gly Ala 

195 * 200 205 

Cys cys Gly Thr Gly Gly cys Cys Gly Ala Ala Ala Ala Gly cys Thr 

210 215 220 

Gly cys Ala Ala cys Gly cys Gly Ala Gly Thr Thr cys cys Thr Gly 
225 230 235 240 
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Glu Gin Ala Pro Leu Thr val Ala Glu Lys Leu i n Arg Glu Phe Leu 

245 250 255 

Gly Thr cys Gly Ala Gly Thr Gly Gly Cys Gly cys cys Gly cys Gly 

260 265 270 

Thr Gly Ala Gly Thr Ala Ala Gly Gly cys Cys cys cys Gly Gly Ala 

275 280 285 

Gly Gly cys cys cys Thr Cys Thr Thr Cys Thr Thr Thr Gly Thr cys 

290 2§5 300 

val Glu Trp Arg Arg val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
305 310 315 320 

cys Ala Gly Thr Thr cys Gly Ala Gly Ala Ala Gly Gly Gly Gly Gly 

325 330 335 

Ala cys Ala Gly cys Thr Ala Cys Thr Thr Cys cys Ala cys Cys Thr 

340 345 350 

Gly cys Ala cys Ala Thr Cys cys Thr Gly Gly Thr Gly Gly Ala Gly 

355 360 .365 

Gin Phe Glu Lys Gly Asp Ser Tyr Phe His Leu His lie Leu val Glu 

370 375 380 

Ala cys cys Gly Thr Gly Gly Gly cys Gly Thr Cys Ala Ala Ala Thr 
385 390 395 400 

cys cys Ala Thr Gly Gly Thr Gly Gly Thr Gly Gly Gly Cys Cys Gly 

405 410 415 

Cys Thr Ala Cys Gly Thr Gly Ala Gly cys cys Ala Gly Ala Thr Thr 

420 425 430 

Thr val Gly val Lys ser Met val val Gly Arg Tyr val ser Gin lie 

435 440 445 

Ala Ala Ala Gly Ala Gly Ala Ala Gly cys Thr Gly Gly Thr Gly Ala 

450 455 460 

cys cys cys Gly cys Ala Thr Cys Thr Ala cys cys Gly cys Gly Gly 
465 470 475 480 

Gly Gly Thr Cys Gly Ala Gly Cys Cys Gly Cys Ala Gly Cys Thr Thr 

485 490 495 

Lys Glu Lys Leu Val Thr Arg lie Tyr Arg Gly Val Glu Pro Gin Leu 

500 505 510 

cys cys Gly Ala Ala Cys Thr Gly Gly Thr Thr cys Gly Cys Gly Gly 

515 520 525 

Thr Gly Ala cys Cys Ala Ala Gly Ala Cys Gly Cys Gly Thr Ala Ala 

530 535 540 • 

Thr Gly Gly Cys Gly Cys Cys Gly Gly Ala Gly Gly Cys Gly Gly Gly 
545 550 l 555 560 

pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

565 570 575 

Ala Ala cys Ala Ala Gly Gly Thr Gly Gly Thr Gly Gly Ala cys Gly 

580 585 590 

Ala cys Thr Gly Cys Thr Ala cys Ala Thr cys Cys Cys cys Ala Ala 

595 600 605 

cys Thr Ala cys cys Thr Gly Cys Thr cys cys cys cys Ala Ala Gly 

610 615 620 

Asn Lys val Val Asp Asp cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
625 630 635 640 

Ala cys cys cys Ala Gly cys Cys Cys Gly Ala Gly Cys Thr Cys cys 

645 650 , 655 

Ala Gly Thr Gly Gly Gly Cys Gly Thr Gly Gly Ala Cys Thr Ala Ala 

660 665 670 

Cys Ala Thr Gly Gly Ala Cys Cys Ala Gly Thr Ala Thr Ala Thr Ala 

675 680 685 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Asp Gin Tyr lie 

690 695 700 

Ala Gly cys Gly Cys Cys Thr Gly Thr Thr Thr Gly Ala Ala Thr Cys 
705 710 715 720 

Thr cys Gly cys Gly Gly Ala Gly Cys Gly Thr Ala Ala Ala cys Gly 

725 730 735 

Gly cys Thr Gly Gly Thr Gly Gly cys Gly Cys Ala Gly cys Ala Thr 
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740 745 , 750 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 

755 760 , 765 

cys Thr Gly Ala cys Gly Cys Ala cys Gly Thr Gly Thr Cys Gly Cys 

770 775 780 

Ala Gly Ala cys Gly Cys Ala Gly Gly Ala Gly cys Ala Gly Ala Ala 
785 790 795 , 800 

cys Ala Ala Gly Gly Ala Ala Ala Ala cys Cys Ala Gly Ala Ala Cys 

805 810 815 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

820 825 830 

cys cys Cys Ala Ala Thr Thr Cys Thr Gly Ala cys Gly Cys Gly cys 

835 840 845 

cys Gly Gly Thr cys Ala Thr Cys Ala Gly Gly Thr cys Ala Ala Ala 

850 855 860 

Ala Ala Cys cys Thr cys cys Gly cys cys Ala Gly Gly Thr Ala Cys 
865 870 875 880 

pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr ser Ala Arg Tyr 

885 890 895 

Ala Thr Gly Gly Ala Gly cys Thr Gly Gly Thr Cys Gly Gly Gly Thr 

900 905 910 

Gly Gly cys Thr Gly Gly Thr Gly Gly Ala cys Cys Gly Cys Gly Gly 

915 920 925 

Gly Ala Thr Cys Ala Cys Gly Thr Cys Ala Gly Ala Ala Ala Ala Gly 

930 935 940 

Met Glu Leu val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 
945 950 955 960 

cys Ala Ala Thr Gly Gly Ala Thr cys Cys Ala Gly Gly Ala Gly Gly 

965 970 975 

Ala Cys Cys Ala Gly Gly cys Gly Thr Cys cys Thr Ala Cys Ala Thr 

980 985 990 

Cys Thr cys cys Thr Thr cys Ala Ala cys Gly Cys Cys Gly Cys cys 

995 1000 1005 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie ser Phe Asn Ala Ala 

1010 1015 1020 

Thr cys cys Ala Ala Cys Thr Cys Gly cys Gly Gly Thr Cys Ala cys 
1025 1030 1035 1040 

Ala Ala Ala Thr Cys Ala Ala Gly Gly Cys Cys Gly Cys Gly Cys Thr 

1045 1050 1055 

Gly Gly Ala cys Ala Ala Thr Gly cys cys Thr cys cys Ala Ala Ala 

1060 1065 1070 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Ser Lys 

1075 1080 1085 

Ala Thr Cys Ala Thr Gly Ala Gly cys cys Thr Gly Ala cys Ala Ala 

1090 1095 1100 

Ala Gly Ala cys Gly Gly Cys Thr cys cys Gly Gly Ala cys Thr Ala 
1105 1110 1115 n 1120 

Cys Cys Thr Gly Gly Thr Gly Gly Gly Cys^Cys Ala Gly Ala ^Ta^Cys 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu val Gly Gin Asn 

1140 1145 1150 

Cys cys Gly cys Cys Gly Gly Ala Gly Gly Ala cys Ala Thr Thr Thr 

1155 1160 , 1165 

cys cys Ala Gly Cys Ala Ala cys cys Gly cys Ala Thr cys Thr Ala 

1170 1175 1180 

Cys cys Gly Ala Ala Thr cys cys Thr Cys Gly Ala Gly Ala Thr Gly 
1185 1190 1195 1200 

Pro Pro Glu Asp lie Ser Ser Asn Arg lie Tyr Arg lie Leu Glu Met 

1205 1210 1215 

Ala Ala Cys Gly Gly Gly Thr Ala cys Gly Ala Thr cys cys Gly Cys 

1220 1225 1230 

Ala Gly Thr Ala cys Gly cys Gly Gly Cys cys Thr Cys cys Gly Thr 
1235 1240 1245 
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cvs Thr Thr cys Cys Thr Gly Gly Gly Cys Thr Gly Gly Gly cys Gly 

1250 1255 1260 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala ser val Phe Leu Gly Trp Ala 
1265 1270 1275 , 1280 

Cys Ala Ala Ala Ala Gly Ala Ala Gly Thr Thr Cys Gly Gly Gly Ala 

y 1285 1290 1295 

Ala Gly Ala Gly Gly Ala Ala Cys Ala Cys cys Ala Thr cys Thr Gly 

1300 1305 1310 

Gly cys Thr cys Thr Thr Thr Gly Gly Gly cys cys Gly Gly cys cys 

1315 1320 1325 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

1330 1335 1340 

Ala cys Gly Ala Cys Gly Gly Gly Thr Ala Ala Ala Ala cys Cys Ala 
1345 1350 1355 1360 

Ala cys Ala Thr Cys Gly Cys Gly Gly Ala Ala Gly cys cys Ala Thr 

1365 1370 1375 

cys Gly Cys cys cys Ala cys Gly ^s^cys Gly Thr Gly ^| Q Cy s c y s 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala val Pro 

1395 1400 1405 

Thr Thr cys Thr Ala cys Gly Gly cys Thr Gly cys Gly Thr Gly Ala 

1410 1415 1420 

Ala Cys Thr Gly Gly Ala cys Cys Ala Ala Thr Gly Ala Gly Ala Ala 
1425 1430 1435 1440 

Cys Thr Thr Thr Cys Cys Gly Thr Thr Cys Ala Ala Cys Gly Ala Thr 

1445 1450 1455 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

1460 1465 1470 

Thr Gly cys Gly Thr cys Gly Ala Cys Ala Ala Gly Ala Thr Gly Gly 

1475 1480 1485 

Thr Gly Ala Thr cys Thr Gly Gly Thr Gly Gly Gly Ala Gly Gly Ala 

1490 1495 1500 

Gly Gly Gly cys Ala Ala Gly Ala Thr Gly Ala cys Gly Gly Cys cys 
1505 1510 1515 1520 

cys Val Asp Lys Met val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

1525 1530 , 1535 

Ala Ala Gly Gly Thr cys Gly Thr Ala Gly Ala Gly Ala Gly cys Gly 

1540 1545 1550 

cys Cys Ala Ala Gly Gly Cys Cys Ala Thr cys Cys Thr Gly Gly Gly 

1555 1560 1565 

cys Gly Gly Ala Ala Gly cys Ala Ala Gly Gly Thr Gly cys Gly cys 

1570 1575 1580 

Lys Val val Glu ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
1585 1590 1595 1600 

Gly Thr Gly Gly Ala cys cys Ala Ala Ala Ala Gly Thr Gly Cys Ala 

1605 1610 1615 

Ala Gly Thr cys Ala Thr Cys Gly Gly cys Cys Cys Ala Gly Ala Thr 

1620 1625 1630 

cys Gly Ala Cys cys Cys Ala Ala cys Thr Cys cys Cys Gly Thr Gly 

1635 1640 1645 

val Asp Gin Lys Cys Lys ser ser Ala Gin He Asp Pro Thr Pro val 

1650 1655 1660 

Ala Thr cys Gly Thr cys Ala cys cys Thr cys cys Ala Ala Cys Ala 
1665 1670 1675 1680 

Cys cys Ala Ala Cys Ala Thr Gly Thr Gly Cys Gly cys Gly Gly Thr 

1685 1690 1695 

Cys Ala Thr cys Gly Ala cys Gly Gly Ala Ala Ala Cys Thr cys Gly 

1700 1705 1710 

He val Thr ser Asn Thr Asn Met cys Ala val lie Asp Gly Asn Ser 

1715 1720 1725 

Ala cys cys Ala cys Cys Thr Thr cys Gly Ala Gly cys Ala cys cys 

1730 1735 1740 

Ala Ala cys Ala Ala cys cys Ala cys Thr cys cys Ala Gly Gly Ala 
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1745 1750 1755 1760 

cys cys Gly Gly Ala Thr Gly Thr Thr c ^ o Ala Ala Gly Thr ^hr^cys 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

1780 1785 1790 

Glv Ala Gly cys Thr cys Ala Cys cys Ala Ala Gly cys Gly Cys cys 

1795 1800 1805 

Thr Gly Gly Ala Gly Cys Ala Cys Gly Ala Cys Thr Thr Thr Gly Gly 

1810 1815 1820 

cys Ala Ala Gly Gly Thr cys Ala cys Cys Ala Ala Gly cys Ala Gly 
1825 1830 1835 1840 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

1845 1850 1855 

Gly Ala Ala Gly Thr Cys Ala Ala Ala Gly Ala Cys Thr Thr Thr Thr 

1860 1865 1870 

Thr cys cys Gly Gly Thr Gly Gly Gly cys Gly Thr cys Ala Gly Ala 

1875 1880 , , 1885 

Thr cys Ala cys Gly Thr Gly Ala cys cys Gly Ala Gly Gly Thr Gly 

1890 1895 1900 _ 

Glu val Lys Asp Phe Phe Arg Trp Ala ser Asp His Val Thr Glu val 
1905 1910 1915 1920 

Ala cys Thr Cys Ala cys Gly Ala Gly Thr Thr Thr Thr Ala cys Gly 

1925 1930 1935 

Thr cys Ala Gly Ala Ala Ala Gly Gly Gly Thr Gly Gly Ala Gly Cys 

1940 1945 1950 

Thr Ala Gly Ala Ala Ala Gly Ala Gly Gly cys cys Cys Gly Cys Cys 

1955 I960 1965 

Thr His Glu Phe Tyr val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

1970 1975 1?80 

Cys Cys Cys Ala Ala Thr Gly Ala Cys Gly Cys Ala Gly Ala Thr Ala 
1985 1990 1995 2000 

Thr Ala Ala Gly Thr Gly Ala Gly cys cys cys Ala Ala Gly cys Gly 

2005 2010 2015 

Gly Gly Cys cys Thr Gly Thr Cys Cys Gly Thr Cys Ala Gly Thr Thr 

2020 2025 2030 

Pro Asn Asp Ala Asp lie ser Glu Pro Lys Arg Ala Cys Pro Ser Val 

2035 2040 2045 

Gly Cys Gly Cys Ala Gly Cys Cys Ala Thr Cys Gly Ala cys Gly Thr 

2050 2055 2060 

cys Ala Gly Ala Cys Gly cys Gly Gly Ala Ala Gly Cys Thr cys cys 
2065 2070 2075 , , 2080 

Gly Gly Thr Gly Gly Ala cys Thr Ala Cys Gly cys Gly Gly Ala cys 

2085 2090 2095 

Ala Gin Pro Ser Thr ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

2100 2105 2110 

Ala Gly Gly Thr Ala cys cys Ala Ala Ala Ala Cys Ala Ala Ala Thr 

2115 2120 2125 

Gly Thr Thr cys Thr Cys Gly Thr cys Ala cys Gly Thr Gly Gly Gly 

2130 2135 2140 

Thr Ala Thr Gly Ala Ala Thr cys Thr Gly Ala Thr Gly cys Thr Thr 
2145 2150 2155 2160 

Arg Tyr Gin Asn Lys cys Ser Arg His val Gly Met Asn Leu Met Leu 

2165 2170 2175 

Thr Thr Thr cys Cys cys Thr Gly cys cys Gly Gly cys Ala Ala Thr 

2180 2185 2190 

Gly cys Gly Ala Gly Ala Gly Ala Ala Thr Gly Ala Ala Thr cys Ala 

2195 2200 2205 

Gly Ala Ala Thr Gly Thr Gly Gly Ala Cys Ala Thr Thr Thr Gly Cys 

2210 2215 , 2220 

Phe pro Cys Arg Gin cys Glu Arg Met Asn Gin Asn val Asp lie cys 
2225 2230 2235 2240 

Thr Thr cys Ala cys Gly cys Ala cys Gly Gly Gly Gly Thr cys Ala 
2245 2250 2255 
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Thr Gly Gly Ala Cys Thr Gly Thr Gly Cys cys Giy Ala Gly Thr Gly 

2260 2265 , 2270 

cys Thr Thr Cys cys cys Cys Gly Thr Gly Thr cys Ala Gly Ala Ala 

2275 2280 2285 

Phe Thr His Gly val Met Asp cys Ala Glu cys Phe Pro val Ser Glu 

2290 2295 2300 

Thr cys Thr Cys Ala Ala Cys cys Cys Gly Thr Gly Thr Cys Thr Gly 
2305 2310 2315 2320 

Thr Cys Gly Thr Cys Ala Gly Ala Ala Ala Gly cys Gly Gly Ala Cys 

2325 2330 2335 

Gly Thr Ala Thr Cys Ala Gly Ala Ala Ala cys Thr Gly Thr Gly Thr 

2340 2345 23 50 

ser Gin Pro val ser val Val Arg Lys Arg Thr Tyr Gin Lys Leu Cys 

2355 2360 2365 

cys Cys Gly Ala Thr Thr Cys Ala Thr cys Ala cys Ala Thr Cys Ala 

2370 2375 ^ , 2380 

Thr Gly Gly Gly Gly Ala Gly Gly Gly cys Gly cys cys cys Gly Ala 
2385 2390 2395 2400 

Gly Gly Thr Gly Gly Cys cys Thr Gly Cys Thr Cys Gly Gly cys Cys 

2405 2410 2415 

pro lie His His He Met Gly Arg Ala Pro Glu val Ala cys ser Ala 

2420 2425 2430 

Thr Gly cys Gly Ala Ala Cys Thr Gly Gly cys cys Ala Ala Thr Gly 

2435 2440 2445 

Thr Gly Gly Ala cys Thr Thr Gly Gly Ala Thr Gly Ala cys Thr Gly 

2450 2455 2460 

Thr Gly Ala cys Ala Thr Gly Gly Ala Ala cys Ala Ala Thr Ala Ala 
2465 2470 2475 2480 

Cys Glu Leu Ala Asn Val Asp Leu Asp Asp Cys Asp Met Glu Gin 
2485 2490 2495 

<210> 4 
<211> 734 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 4 , _ 

Met Thr Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu ser Glu 
15 10 15 

Gly Val Arg Glu Trp Trp Ala Leu Gin Pro Gly Ala Pro Lys Pro Lys 

20 25 30 

Ala Asn Gin Gin His Gin Asp Asn Ala Arg Gly Leu val Leu Pro Gly 

35 40 45 

Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro val 

50 55 60 

Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin 
65 70 75 , 80 

Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 

85 90 95 

Ala Glu Phe Gin Gin Arg Leu Gin Gly Asp Thr Ser Phe Gly Gly Asn 

100 105 110 

Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg val Leu Glu Pro Leu 

115 120 125 

Gly Leu val Glu Gin Ala Gly Glu Thr Ala Pro Gly Lys Lys Arg Pro 

130 135 140 

Leu He Glu ser Pro Gin Gin Pro Asp ser Ser Thr Gly He Gly Lys 
145 150 155 160 

Lys Gly Lys Gin Pro Ala Lys Lys Lys Leu val Phe Glu Asp Glu Thr 
165 170 175 
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Gly Ala Gly Asp Gly Pro Pro. Glu Gly Ser Thr Ser Gly Ala Met ser 

180 185 190 

Asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala Val Glu Gly 

195 200 205 

Gly Gin Gly Ala Asp Gly val Gly Asn Ala ser Gly Asp Trp ms cys 

210 215 220 

Asd ser Thr Trp Ser Glu Gly His Val Thr Thr Thr ser Thr Arg Thr 
225 230 235 240 

Trp val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu Gly Glu 

245 250 255 

Ser Leu Gin Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 

260 265 270 

Phe Asp Phe Asn Arg Phe His Cys His Phe ser Pro Arg Asp Trp Gin 

275 280 285 

Arq Leu lie Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met Arg Val 

290 295 300 

Lys lie Phe Asn lie Gin val Lys Glu val Thr Thr ser Asn Gly Glu 
305 310 315 320 

Thr Thr val Ala Asn Asn Leu Thr Ser Thr Val Gin lie Phe Ala Asp 

325 330 335 

ser Ser Tyr Glu Leu Pro Tyr val Met Asp Ala Gly Gin Glu Gly Ser 

340 345 n 350 

Leu Pro Pro Phe Pro Asn Asp Val Phe Met val Pro Gin Tyr Gly Tyr 

355 360 365 

cys Gly Leu val Thr Gly Asn Thr Ser Gin Gin Gin Thr Asp Arg Asn 

370 375 380 

Ala Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly 
385 390 395 400 

Asn Asn Phe Glu He Thr Tyr ser Phe Glu Lys Val Pro Phe His ser 

405 410 415 

Met Tyr Ala His ser Gin ser Leu Asp Arg Leu Met Asn Pro Leu He 

420 425 430 

Asp Gin Tyr Leu Trp Gly Leu Gin Ser Thr Thr Thr Gly Thr Thr Leu 

435 440 445 

Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro Thr Asn 

450 455 460 

Phe Ser Asn Phe Lys Lys Asn Trp Leu Pro Gly Pro Ser lie Lys Gin 
465 470 475 480 

Gin Gly Phe Ser Lys Thr Ala Asn Gin Asn Tyr Lys lie Pro Ala Thr 

485 490 495 

Gly ser Asp ser Leu He Lys Tyr Glu Thr His Ser Thr Leu Asp Gly 

500 505 510 

Arg Trp ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala Gly Pro 

515 520 525 

Ala Asp Ser Lys Phe Ser Asn ser Gin Leu lie Phe Ala Gly Pro Lys 

530 535 540 

Gin Asn Gly Asn Thr Ala Thr val Pro Gly Thr Leu He Phe Thr ser 
545 550 555 560 

Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met Trp Gly 

565 570 575 

Asn Leu Pro Gly Gly Asp Gin Ser Asn ser Asn Leu Pro Thr val Asp 

580 585 590 

Arg Leu Thr Ala Leu Gly Ala Val Pro Gly Met val Trp Gin Asn Arg 

595 600 605 a 

Asp lie Tyr Tyr Gin Gly Pro He Trp Ala Lys lie Pro His Thr Asp 

610 615 620 

Gly His Phe His Pro Ser Pro Leu lie Gly Gly Phe Gly Leu Lys His 
625 630 635 n 640 

Pro Pro Pro Gin lie Phe lie Lys Asn Thr Pro val Pro Ala Asn Pro 

645 650 . 655 

Ala Thr Thr Phe Ser Ser Thr Pro val Asn ser Phe lie Thr Gin Tyr 

660 665 670 

Ser Thr Gly Gin Val ser Val Gin lie Asp Trp Glu lie Gin Lys Glu 
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675 680 685 

Ara ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly 

690 695 700 

Gin Gin Asn ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys Tyr Thr 
705 710 715 

Glu Pro Arg Ala lie Gly Thr Arg Tyr Leu Thr His His Leu 
725 730 



<210> 5 
<211> 2208 
<212> DNA 

<213> Artificial Sequence 

< 220> . -. 

<223> Description of Artificial Sequence; note 

synthetic construct 

<221> misc_feature 
<222> (0) . . . (0) 
<223> n=a,t,c, or g 

<221> variation 
<222> CO)... CO) 
<223> Xaa = any amino acid 



<400> 5 

atgactgacg 

tggtgggcgc 

gctcggggtc 

ggggaacccg 

cagctcaagg 

cagcggcttc 

aaaaagaggg 

aagaagagac 

aaaggcaagc 

ggaccccctg 

gctggcggag 

gattggcatt 

tgggtcttgc 

aacacctaca 

cacttctcac 

gccatgcggg 

acaacggtgg 

ctgccgtacg 

tttatggtgc 

actgacagaa 

aacaactttg 

agccagagcc 

tcgaccacca 

cggcctacca 

cagggcttct 

ctcatcaaat 

cctccaatgg 

gcggggccta 

gaggaggagc 

ggtgaccaga 

cctggaatgg 

cctcataccg 

ccgcctcctc 

agctctactc 

attgactggg 

tccaactacg 

gagcctaggg 



gttaccttcc 
tgcaacctgg 
ttgtgcttcc 
tcaacgcagc 
ccggtgacaa 
agggcgacac 
ttcttgaacc 
cgttgattga 
agccggctaa 
agggatcaac 
ctgcagtcga 
gcgattccac 
ccacctacaa 
acggattctc 
cacgtgactg 
tcaaaatctt 
ctaataacct 
tgatggatgc 
cccagtacgg 
atgccttcta 
aaattacgta 
tggaccggct 
ccggaaccac 
acttttccaa 
caaagactgc 
acgagacgca 
ccacggctgg 
aacagaacgg 
tggcagccac 
gcaacagcaa 
tctggcaaaa 
atggacactt 
aaatttttat 
cggtaaactc 
agatccagaa 
gacagcaaaa 
ctatcggtac 



agattggcta 
agcccctaaa 
gggttacaaa 
ggacgcggca 
cccctacctc 
atcgtttggg 
tcttggtctg 
atccccccag 
aaagaagctc 
ttccggagcc 
gggsggacaa 
ctggtctgag 
caaccacctn 
caccccctgg 
gcagcgactc 
caacatccag 
taccagcacg 
gggtcaagag 
ctactgtgga 
ctgcctggag 
cagttttgag 
gatgaaccct 
cctgaatgcc 
ctttaaaaag 
caatcaaaac 
cagcactctg 
acctgcggac 
caacacggcc 
caacgccacc 
cctgccgacc 
cagagacatt 
tcacccctca 
caagaacacc 
cttcattact 
ggagcggtcc 
ctctctgttg 
ccgctacctc 



gaggacaacc 
cccaaggcaa 
tacctcggac 
gccctcgagc 
aagtacaacc 
ggcaacctcg 
gttgagcaag 
cagcccgact 
gttttcgaag 
atgtctgatg 
ggtgccgatg 
ggccacgtca 
tacaagcgac 
ggatactttg 
atcaacaaca 
gtcaaggagg 
gttcagatct 
ggcagcctgc 
ctggtgaccg 
tactttcctt 
aaggtgcctt 
ctcatcgacc 
gggactgcca 
aactggctgc 
tacaagatcc 
gacggaagat 
agcaagttca 
accgtacccg 
gatacggaca 
gtggacagac 
tactaccagg 
ccgctgattg 
ccggtacctg 
cagtacagca 
aaacgctgga 
tgggctcccg 
acccaccacc 



tctctgaagg 
atcaacaaca 
ccggcaacgg 
acgacaaggc 
acgccgacgc 
gcagagcagt 
cgggtgagac 
cctccacggg 
acgaaactgg 
acagtgagat 
gagtgggtaa 
cgaccaccag 
tcggagagag 
acttcaaccg 
actggggcat 
tcacgacgtc 
ttgcggactc 
ctccttttcc 
gcaacacttc 
cgcagatgct 
tccactcgat 
agtacctgtg 
ccaccaactt 
ccgggccttc 
ctgccaccgg 
ggagtgccct 
gcaacagcca 
ggactctgat 
tgtggggcaa 
tgacagcctt 
gtcccatttg 
gtgggtttgg 
cgaatcctgc 
ctggccaggt 
accccgaggt 
atgcggctgg 
tgtaataa 



cgttcgagag 
tcaggacaac 
actcgacaag 
ctacgaccag 
ggagttccag 
cttccaggcc 
ggctcctgga 
tatcggcaaa 
agcaggcgac 
gcgtgcagca 
tgcctcgggt 
caccagaacc 
cctgcagtcc 
cttccactgc 
gcgacccaaa 
gaacggcgag 
gtcgtacgaa 
caacgacgtc 
gcagcaacag 
gcggactggc 
gtacgcgcac 
gggactgcaa 
taccaagctg 
aatcaagcag 
gtcagacagt 
gacccccgga 
gctcatcttt 
cttcacctct 
cctacctggc 
gggagccgtg 
ggccaagatt 
gctgaaacac 
aacgaccttc 
gtcggtgcag 
ccagtttacc 
gaaatacact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2208 
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<210> 6 
<211> 125 
<212> DNA 

<213> Artificial sequence 

<220> ^ ljC . . 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 6 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctggagacc aaaggtctcc 



60 
20 
125 



agactgccgg cctctggccg gcagggccga gtgagtgagc gagcgcgcat agagggagtg 120 
gccaa 

<210> 7 
<211> 245 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 7 cn 

ctccatcatc taggtttgcc cactgacgtc aatgtgacgt cctagggtta gggaggtccc 60 

tgtattagca gtcacgtgag tgtcgtattt cgcggagcgt agcggagcgc ataccaagct 120 

gccacgtcac agccacgtgg tccgtttgcg acagtttgcg acaccatgtg gtcaggaggg 180 

tatataaccg cgagtgagcc agcgaggagc tccattttgc ccgcgaattt tgaacgagca 240 

gcagc l ^ 

<210> 8 

<211> 313 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 8 

Met Glu Leu val Gly Trp Leu Val Asp Arg Gly He Thr Ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

20 25 30 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala ser Lys 

35 40 45 

He Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Asn 

50 55 60 

Pro pro Glu Asp lie ser ser Asn Arg He Tyr Arg He Leu Glu Met 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser val Phe Leu Gly Trp Ala 

85 90 95 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

100 105 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala Val pro 

115 120 125 

Phe Tyr Gly cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys val val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys val Arg 
165 170 175 
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val Asp Gin Lys Cys Lys ser Ser Ala Gin lie Asp pro Thr Pro Val 

180 185 190 

lie val Thr Ser Asn Thr Asn Met cys Ala Val lie Asp Gly Asn ser 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 

225 230 235 _ _ 24 ° 

Glu val Lys Asp Phe Phe Arg Trp Ala Ser Asp His val Thr Glu Val 

Thr His Glu Phe Tyr Val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

260 265 270 

pro Asn Asp Ala Asp lie Ser Glu Pro Lys Arg Ala Cys Pro ser Val 

275 280 285 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Pro Val Asp Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly Gin Pro Leu Xaa 
305 310 

<210> 9 
<211> 399 
<212> PRT 

<213> Artificial Sequence 

<220> . n 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 9 

Met Glu Leu val Gly Trp Leu Val Asp Arg Gly lie Thr ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie ser Phe Asn Ala Ala 

20 25 30 

ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Ser Lys 

35 40 45 

lie Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu val Gly Gin Asn 

50 55 60 

Pro Pro Glu Asp lie Ser Ser Asn Arg lie Tyr Arg lie Leu Glu Met 
65 70 " 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala ser Val Phe Leu Gly Trp Ala 

85 90 95 

Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 105 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala Val Pro 

115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cys val Asp Lys Met val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val val Glu ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 

180 185 190 

He val Thr ser Asn Thr Asn Met Cys Ala val lie Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

2 *0 215 220 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Asp Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu val 

245 250 255 

Thr His Glu Phe Tyr Val Arg Lys Gly Gly Ala Arg Lys Arq Pro Ala 
260 265 270 
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Pro Asn Asp Ala Asp lie Ser Glu Pro Lys Arg Ala Cys pro ser val 

275 280 285 

Ala Gin Pro ser Thr ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

290 295 300 

Arg Tyr Gin Asn Lys cys ser Arg His val Gly Met Asn Leu Met Leu 
305 310 315 320 

phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn val Asp lie Cys 

325 330 335 

Phe Thr His Gly val Met Asp Cys Ala Glu cys Phe Pro val Ser Glu 

340 345 350 

ser Gin Pro Val ser Val val Arg Lys Arg Thr Tyr Gin Lys Leu Cys 

355 360 365 

pro lie His His lie Met Gly Arg Ala Pro Glu val Ala cys ser Ala 

370 375 380 

cys Glu Leu Ala Asn Val Asp Leu Asp Asp cys Asp Met Glu Gin 
385 390 395 

<210> 10 
<211> 537 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 



10 15 
Phe val ser Trp val Ala Glu 
30 

Asp Met Asp Leu Asn Leu lie 
45 

Lys Leu Gin Arg Glu Phe Leu 
60 

Pro Glu Ala Leu Phe Phe Val 
75 80 
His Leu His lie Leu Val Glu 
90 95 
Gly Arg Tyr Val ser Gin lie 
110 

Arg Gly val Glu Pro Gin Leu 
125 

Arg Asn Gly Ala Gly Gly Gly 
140 

Pro Asn Tyr Leu Leu Pro Lys 
155 160 
Thr Asn Met Asp Gin Tyr lie 
170 175 , 

Lys Arg Leu val Ala Gin His 
190 

Gin Asn Lys Glu Asn Gin Asn 
205 

Ser Lys Thr ser Ala Arg Tyr 
220 

Arg Gly lie Thr ser Glu Lys 
235 240 
Tyr lie ser Phe Asn Ala Ala 
250 255 
Ala Leu Asp Asn Ala ser Lys 
270 

Asp Tyr Leu Val Gly Gin Asn 
285 



<400> 10 










Val Leu 


Met 


Pro 


Gly 


Phe 


Tyr 


Glu 


He 


1 

Glu 


His 


Leu 


Pro 


Gly 


lie 


Ser 


Asp Ser 








20 






25 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


pro 


Asp ser 




35 










40 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


val 


Ala Glu 




50 










55 


Lys Ala 


Val 


Glu 


Trp 


Arg 


Arg 


val 


Ser 


65 






70 




Tyr Phe 


Gin 


Phe 


Glu 


Lys 


Gly 


ASP 


Ser 








85 






val val 


Thr 


val 


Gly 


val 


Lys 


Ser 


Met 






100 






105 


Lys 


Glu 


Lys 


Leu 


val 


Thr 


Arg 


lie Tyr 




115 










120 


Pro 


Asn 


Trp 


Phe 


Ala 


Val 


Thr 


Lys Thr 




130 








135 


Tyr lie 


Asn 


Lys 


val 


val 


Asp 


Asp 


cys 


145 








150 




Ala Trp 


Thr 


Gin 


pro 


GlU 


Leu 


Gin 


Trp 










165 




Glu Arg 


Ser 


Ala 


cys 


Leu 


Asn 


Leu 


Ala 






180 








185 


Leu 


Thr 


His 


val 


ser 


Gin 


Thr 


Gin Glu 




195 








val 


200 


pro 


Asn 


ser 


Asp 


Ala 


pro 


He Arg 




210 








215 


Val Asp 


Met 


Glu 


Leu 


val 


Gly 


Trp 


Leu 


225 








230 




Ala Ser 


Gin 


Trp 


He 


Gin 


Glu 
245 


Asp 


Gin 


ser 


Asn 


Ser 


Arg 


Ser 


Gin 


He 


Lys Ala 






260 








265 


He 


Met 


Ser 


Leu 


Thr 


Lys 


Thr 


Ala Pro 






275 








280 
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290 

Asn Gly Tyr Asp 

Gin Lys Lys Phe 

Thr Thr Gly Lys 
340 

Phe Tyr Gly cys 
355 

cys val Asp Lys 
370 

Lys Val val Glu 
385 

val Asp Gin Lys 

lie val Thr Ser 
420 

Thr Thr Phe Glu 
435 

Glu Leu Thr Lys 
450 

Glu Val Lys Asp 
465 

Thr His Glu Phe 

pro Asn Asp Ala 
500 

Ala Gin Pro Ser 
515 

Arg Leu Ala Arg 
530 



<210> 11 
<211> 623 
<212> PRT 

<213> Artificial Sequence 



lie 


Ser 


Ser 


Asn 


Arg 


lie 


Tyr 


Arg 


He 


Leu 


Glu 


Met 






295 






300 




Gly 




Ala 


Pro 


Gin 


Tyr 


Ala 


Ala 


ser 


val 


Phe 


Leu 


Trp 




310 








315 






Gly 




320 




Lys 


Arg 


Asn 


Thr 


lie 


Trp 


Leu 


Phe 


pro 


Ala 


325 






330 








335 




Thr 


Asn 


lie 


Ala 


Glu 


Ala 


lie 


Ala 


His 


Ala 


val 


Pro 








345 










350 






Val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Phe 


pro 


Phe 


Asn 


ASp 






360 










365 




Thr 


Ala 


Met 


val 


He 


Trp 


Trp 


Glu 


Glu 


Gly 


Lys 


Met 






375 






380 






val 




ser 


Ala 


Lys 


Ala 


lie 


Leu 


six 


Gly 


ser 


Lys 


Arg 




390 








395 




Thr 




400 


cys 


Lys 


Ser 


Ser 


Ala 


Gin 


lie 


Asp 


Pro 


Pro 


Val 


405 








410 










415 




Asn 


Thr 


Asn 


Met 


cys 


Ala 


val 


He 


ASP 


Gly 


Asn 


ser 










425 










430 




Phe 


His 


Gin 


Gin 


Pro 


Leu 


Gin 


Asp 


Arg 


Met 


Phe 


Lys 








440 








445 






Gin 


Arg 


Leu 


Glu 


His 


Asp 


Phe 


Gly 


Lys 


Val 


Thr 


Lys 




455 








460 






G 1 U 


va I 


Phe 


Phe 


Arg 


Trp 


Ala 


Ser 


Asp 


His 


Val 


Thr 




470 






475 










480 


Tyr 


val 


Arg 


Lys 


Gly 


Gly 


Ala 


Arg 


Lys 


Arg 


Pro 


Ala 


485 






490 










495 


Val 


Asp 


He 


Ser 


Glu 


Pro 


Lys 


Arg 


Ala 


cys 


Pro 


Ser 








505 






510 


Ala 




Thr 


Ser 


Asp 


Ala 


Glu 


Ala 


Pro 


Val 


Asp 


Tyr 


Asp 






520 










525 








Gly 


Gin 


Pro 


Leu 


xaa 


















535 





















<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 11 

Met Pro Gly Phe Tyr Glu lie Val Leu Lys val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie ser Asp Ser Phe val Ser Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Glu Phe Leu 

50 55 60 

Val Glu Trp Arg Arg val ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Asp ser Tyr Phe His Leu His lie Leu val Glu 

85 90 95 

Thr val Gly Val Lys ser Met Val Val Gly Arg Tyr val ser Gin He 

100 105 110 

Lys Glu Lys Leu val Thr Arg He Tyr Arg Gly val Glu Pro Gin Leu 

115 120 125 

Pro Asn Trp Phe Ala val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val val Asp Asp Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 
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Thr Gin Pro Glu Leu Gin Trp Ala Trp Tnr Asn Met Asp Gin Tyr He 

165 170 n , 175 

ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His Val ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

pro Asn Ser Asp Ala Pro val lie Arg ser Lys Thr ser Ala Arg Tyr 

210 215 220 

Met Glu Leu Val Gly Trp Leu val Asp Arg Gly lie Thr ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie ser Phe Asn Ala Ala 

245 250 255 

ser Asn ser Arg ser Gin lie Lys Ala Ala Leu Asp Asn Ala Ser Lys 

260 265 270 

lie Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Asn 

275 280 285 

pro Pro Glu Asp lie Ser Ser Asn Arg lie Tyr Arg He Leu Glu Met 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly pro Ala 

325 ~ 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala val Pro 

340 345 350 

Phe Tyr Gly Cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

cys val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 n 380 

Lys Val Val Glu ser Ala Lys Ala He Leu Gly Gly ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys cys Lys Ser ser Ala Gin lie Asp Pro Thr Pro val 

405 410 415 

lie val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 

450 ~ 455 460 

Glu val Lys Asp Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 470 475 480 

Thr His Glu Phe Tyr Val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

485 490 495 

Pro Asn Asp Ala Asp He ser Glu Pro Lys Arg Ala Cys Pro ser val 

500 505 510 

Ala Gin Pro ser Thr Ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

515 520 525 

Arg Tyr Gin Asn Lys cys Ser Arg His Val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Val Asp He Cys 
545 550 555 , 560 

Phe Thr His Gly val Met Asp Cys Ala Glu cys Phe Pro Val Ser Glu 

565 570 575 

Ser Gin Pro Val Ser val val Arg Lys Arg Thr Tyr Gin Lys Leu cys 

580 585 590 

pro He His His He Met Gly Arg Ala Pro Glu Val Ala cys ser Ala 

595 600 605 

cys Glu Leu Ala Asn val Asp Leu Asp Asp cys Asp Met Glu Gin 
610 615 620 

<210> 12 
<211> 939 
<212> DNA 

<213> Artificial sequence 
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<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 12 

atggagctgg 

gaggaccagg 

gccgcgctgg 

gtgggccaga 

aacgggtacg 

gggaagagga 

gaagccatcg 

ccgttcaacg 

aaggtcgtag 

tgcaagtcat 

tgcgcggtca 

atgttcaagt 

gaagtcaaag 

tacgtcagaa 

cccaagcggg 

gactacgcgg 



tcgggtggct 
cgtcctacat 
acaatgcctc 
acccgccgga 
atccgcagta 
acaccatctg 
cccacgccgt 
attgcgtcga 
agagcgccaa 
cggcccagat 
tcgacggaaa 
tcgagctcac 
actttttccg 
agggtggagc 
cctgtccgtc 
acagattggc 



ggtggaccgc 
ctccttcaac 
caaaatcatg 
ggacatttcc 
cgcggcctcc 
gctctttggg 
gcccttctac 
caagatggtg 
ggccatcctg 
cgacccaact 
ctcgaccacc 
caagcgcctg 
gtgggcgtca 
tagaaagagg 
agttgcgcag 
tagaggacaa 



gggatcacgt 
gccgcctcca 
agcctgacaa 
agcaaccgca 
gtcttcctgg 
ccggccacga 
ggctgcgtga 
atctggtggg 
ggcggaagca 
cccgtgatcg 
ttcgagcacc 
gagcacgact 
gatcacgtga 
cccgccccca 
ccatcgacgt 
cctctctga 



cagaaaagca 
actcgcggtc 
agacggctcc 
tctaccgaat 
gctgggcgca 
cgggtaaaac 
actggaccaa 
aggagggcaa 
aggtgcgcgt 
tcacctccaa 
aacaaccact 
ttggcaaggt 
ccgaggtgac 
atgacgcaga 
cagacgcgga 



atggatccag 
acaaatcaag 
ggactacctg 
cctcgagatg 
aaagaagttc 
caacatcgcg 
tgagaacttt 
gatgacggcc 
ggaccaaaag 
caccaacatg 
ccaggaccgg 
caccaagcag 
tcacgagttt 
tataagtgag 
agctccggtg 



<210> 13 
<211> 1197 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 13 

atggagctgg 

gaggaccagg 

gccgcgctgg 

gtgggccaga 

aacgggtacg 

gggaagagga 

gaagccatcg 

ccgttcaacg 

aaggtcgtag 

tgcaagtcat 

tgcgcggtca 

atgttcaagt 

gaagtcaaag 

tacgtcagaa 

cccaagcggg 

gactacgcgg 

tttccctgcc 

gtcatggact 

aagcggacgt 

gcctgctcgg 



tcgggtggct 
cgtcctacat 
acaatgcctc 
acccgccgga 
atccgcagta 
acaccatctg 
cccacgccgt 
attgcgtcga 
agagcgccaa 
cggcccagat 
tcgacggaaa 
tcgagctcac 
actttttccg 
agggtggagc 
cctgtccgtc 
acaggtacca 
ggcaatgcga 
gtgccgagtg 
atcagaaact 
cctgcgaact 



ggtggaccgc 
ctccttcaac 
caaaatcatg 
ggacatttcc 
cgcggcctcc 
gctctttggg 
gcccttctac 
caagatggtg 
ggccatcctg 
cgacccaact 
ctcgaccacc 
caagcgcctg 
gtgggcgtca 
tagaaagagg 
agttgcgcag 
aaacaaatgt 
gagaatgaat 
cttccccgtg 
gtgtccgatt 
ggccaatgtg 



gggatcacgt 
gccgcctcca 
agcctgacaa 
agcaaccgca 
gtcttcctgg 
ccggccacga 
ggctgcgtga 
atctggtggg 
ggcggaagca 
cccgtgatcg 
ttcgagcacc 
gagcacgact 
gatcacgtga 
cccgccccca 
ccatcgacgt 
tctcgtcacg 
cagaatgtgg 
tcagaatctc 
catcacatca 
gacttggatg 



cagaaaagca 
actcgcggtc 
agacggctcc 
tctaccgaat 
gctgggcgca 
cgggtaaaac 
actggaccaa 
aggagggcaa 
aggtgcgcgt 
tcacctccaa 
aacaaccact 
ttggcaaggt 
ccgaggtgac 
atgacgcaga 
cagacgcgga 
tgggtatgaa 
acatttgctt 
aacccgtgtc 
tggggagggc 
actgtgacat 



atggatccag 
acaaatcaag 
ggactacctg 
cctcgagatg 
aaagaagttc 
caacatcgcg 
tgagaacttt 
gatgacggcc 
ggaccaaaag 
caccaacatg 
ccaggaccgg 
caccaagcag 
tcacgagttt 
tataagtgag 
agctccggtg 
tctgatgctt 
cacgcacggg 
tgtcgtcaga 
gcccgaggtg 
ggaacaa 



<210> 14 
<211> 1611 
<212> DNA 

<213> Artificial sequence 

< 22 0> ..... 

<223> Description of Artificial sequence; note = 

synthetic construct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
939 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1197 
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<400> 14 

atgccggggt 

ggcatttctg 

tctgacatgg 

cgcgagttcc 

cagttcgaga 

aaatccatgg 

taccgcgggg 

gccggaggcg 

acccagcccg 

aatctcgcgg 

gagcagaaca 

tccgccaggt 

caatggatcc 

tcacaaatca 

ccggactacc 

atcctcgaga 

caaaagaagt 

accaacatcg 

aatgagaact 

aagatgacgg 

gtggaccaaa 

aacaccaaca 

ctccaggacc 

gtcaccaagc 

actcacgagt 

gatataagtg 

gaagctccgg 



tctacgagat 
actcttttgt 
acttgaatct 
tggtcgagtg 
agggggacag 
tggtgggccg 
tcgagccgca 
ggaacaaggt 
agctccagtg 
agcgtaaacg 
aggaaaacca 
acatggagct 
aggaggacca 
aggccgcgct 
tggtgggcca 
tgaacgggta 
tcgggaagag 
cggaagccat 
ttccgttcaa 
ccaaggtcgt 
agtgcaagtc 
tgtgcgcggt 
ggatgttcaa 
aggaagtcaa 
tttacgtcag 
agcccaagcg 
tggactacgc 



cgtgctgaag 
gagctgggtg 
gattgagcag 
gcgccgcgtg 
ctacttccac 
ctacgtgagc 
gcttccgaac 
ggtggacgac 
ggcgtggact 
gctggtggcg 
gaaccccaat 
ggtcgggtgg 
ggcgtcctac 
ggacaatgcc 
gaacccgccg 
cgatccgcag 
gaacaccatc 
cgcccacgcc 
cgattgcgtc 
agagagcgcc 
atcggcccag 
catcgacgga 
gttcgagctc 
agactttttc 
aaagggtgga 
ggcctgtccg 
ggacagattg 



gtgcccagcg 
gccgagaagg 
gcacccctga 
agtaaggccc 
ctgcacatcc 
cagattaaag 
tggttcgcgg 
tgctacatcc 
aacatggacc 
cagcatctga 
tctgacgcgc 
ctggtggacc 
atctccttca 
tccaaaatca 
gaggacattt 
tacgcggcct 
tggctctttg 
gtgcccttct 
gacaagatgg 
aaggccatcc 
atcgacccaa 
aactcgacca 
accaagcgcc 
cggtgggcgt 
gctagaaaga 
tcagttgcgc 
gctagaggac 



acctggacga 
aatgggagct 
ccgtggccga 
cggaggccct 
tggtggagac 
agaagctggt 
tgaccaagac 
ccaactacct 
agtatataag 
cgcacgtgtc 
cggtcatcag 
gcgggatcac 
acgccgcctc 
tgagcctgac 
ccagcaaccg 
ccgtcttcct 
ggccggccac 
acggctgcgt 
tgatctggtg 
tgggcggaag 
ctcccgtgat 
ccttcgagca 
tggagcacga 
cagatcacgt 
ggcccgcccc 
agccatcgac 
aacctctctg 



gcacctgccc 
gccgccggat 
aaagctgcaa 
cttctttgtc 
cgtgggcgtc 
gacccgcatc 
gcgtaatggc 
gctccccaag 
cgcctgtttg 
gcagacgcag 
gtcaaaaacc 
gtcagaaaag 
caactcgcgg 
aaagacggct 
catctaccga 
gggctgggcg 
gacgggtaaa 
gaactggacc 
ggaggagggc 
caaggtgcgc 
cgtcacctcc 
ccaacaacca 
ctttggcaag 
gaccgaggtg 
caatgacgca 
gtcagacgcg 
a 



<210> 15 
<211> 1872 
<212> DNA 
<213> Artificial 



sequence 



<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 



<400> 15 

atgccggggt 

ggcatttctg 

tctgacatgg 

cgcgagttcc 

cagttcgaga 

aaatccatgg 

taccgcgggg 

gccggaggcg 

acccagcccg 

aatctcgcgg 

gagcagaaca 

tccgccaggt 

caatggatcc 

tcacaaatca 

ccggactacc 

atcctcgaga 

caaaagaagt 

accaacatcg 

aatgagaact 

aagatgacgg 

gtggaccaaa 

aacaccaaca 

ctccaggacc 

gtcaccaagc 



tctacgagat 
actcttttgt 
acttgaatct 
tggtcgagtg 
agggggacag 
tggtgggccg 
tcgagccgca 
ggaacaaggt 
agctccagtg 
agcgtaaacg 
aggaaaacca 
acatggagct 
aggaggacca 
aggccgcgct 
tggtgggcca 
tgaacgggta 
tcgggaagag 
cggaagccat 
ttccgttcaa 
ccaaggtcgt 
agtgcaagtc 
tgtgcgcggt 
ggatgttcaa 
aggaagtcaa 



cgtgctgaag 
gagctgggtg 
gattgagcag 
gcgccgcgtg 
ctacttccac 
ctacgtgagc 
gcttccgaac 
ggtggacgac 
ggcgtggact 
gctggtggcg 
gaaccccaat 
ggtcgggtgg 
ggcgtcctac 
ggacaatgcc 
gaacccgccg 
cgatccgcag 
gaacaccatc 
cgcccacgcc 
cgattgcgtc 
agagagcgcc 
atcggcccag 
catcgacgga 
gttcgagctc 
agactttttc 



gtgcccagcg 
gccgagaagg 
gcacccctga 
agtaaggccc 
ctgcacatcc 
cagattaaag 
tggttcgcgg 
tgctacatcc 
aacatggacc 
cagcatctga 
tctgacgcgc 
ctggtggacc 
atctccttca 
tccaaaatca 
gaggacattt 
tacgcggcct 
tggctctttg 
gtgcccttct 
gacaagatgg 
aaggccatcc 
atcgacccaa 
aactcgacca 
accaagcgcc 
cggtgggcgt 



acctggacga 
aatgggagct 
ccgtggccga 
cggaggccct 
tggtggagac 
agaagctggt 
tgaccaagac 
ccaactacct 
agtatataag 
cgcacgtgtc 
cggtcatcag 
gcgggatcac 
acgccgcctc 
tgagcctgac 
ccagcaaccg 
ccgtcttcct 
ggccggccac 
acggctgcgt 
tgatctggtg 
tgggcggaag 
ctcccgtgat 
ccttcgagca 
tggagcacga 
cagatcacgt 



gcacctgccc 
gccgccggat 
aaagctgcaa 
cttctttgtc 
cgtgggcgtc 
gacccgcatc 
gcgtaatggc 
gctccccaag 
cgcctgtttg 
gcagacgcag 
gtcaaaaacc 
gtcagaaaag 
caactcgcgg 
aaagacggct 
catctaccga 
gggctgggcg 
gacgggtaaa 
gaactggacc 
ggaggagggc 
caaggtgcgc 
cgtcacctcc 
ccaacaacca 
ctttggcaag 
gaccgaggtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1611 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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actcacgagt tttacgtcag aaagggtgga gciagaaaga ggcccgcccc caatgacgca 1500 

gatataagtg agcccaagcg ggcctgtccg tcagttgcgc agccatcgac gtcagacgcg 1560 

gaagctccgg tggactacgc ggacaggtac caaaacaaat gttctcgtca cgtgggtatg 1620 

aatctgatgc tttttccctg ccggcaatgc gagagaatga atcagaatgt ggacatttgc 1680 

ttcacgcacg gggtcatgga ctgtgccgag tgcttccccg tgtcagaatc tcaacccgtg 1740 

tctgtcgtca gaaagcggac gtatcagaaa ctgtgtccga ttcatcacat catggggagg 1800 

gcgcccgagg tggcctgctc ggcctgcgaa ctggccaatg tggacttgga tgactgtgac 1860 

atggaacaat aa 1872 

<210> 16 
<211> 598 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 16 

Thr Ala Pro Gly i_ys Lys Arg Pro Leu lie Glu Ser Pro Gin Gin Pro 

1 5 10 15 

Asp ser ser Thr Gly lie Gly Lys Lys Gly Lys Gin Pro Ala Lys Lys 

20 25 30 

Lys Leu val Phe Glu Asp Glu Thr Gly Ala Gly Asp Gly Pro Pro Glu 

35 40 ■ 45 

Gly Ser Thr Ser Gly Ala Met ser Asp Asp ser Glu Met Arg Ala Ala 

50 55 60 

Ala Gly Gly Ala Ala val Glu Gly Gly Gin Gly Ala Asp Gly val Gly 
65 70 75 n i 80 

Asn Ala ser Gly Asp Trp His Cys Asp Ser Thr Trp Ser Glu Gly His 

85 90 95 

val Thr Thr Thr ser Thr Arg Thr Trp Val Leu Pro Thr Tyr Asn Asn 

100 105 110 

His Leu Tyr Lys Arg Leu Gly Glu Ser Leu Gin Ser Asn Thr Tyr Asn 

115 120 125 

Gly Phe ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys 

y 130 135 140 

His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly 
145 150 155 n 160 

Met Arg pro Lys Ala Met Arg Val Lys lie Phe Asn lie Gin val Lys 

165 ~ 170 175 

Glu val Thr Thr Ser Asn Gly Glu Thr Thr val Ala Asn Asn Leu Thr 

180 185 190 

ser Thr val Gin lie Phe Ala Asp ser ser Tyr Glu Leu Pro Tyr val 

195 200 205 

Met Asp Ala Gly Gin Glu Gly ser Leu Pro Pro Phe Pro Asn Asp Val 

210 215 220 

Phe Met val Pro Gin Tyr Gly Tyr cys Gly Leu val Thr Gly Asn Thr 
225 230 235 n 240 

Ser Gin Gin Gin Thr Asp Arg Asn Ala Phe Tyr Cys Leu Glu Tyr Phe 

245 250 255 

pro ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu lie Thr Tyr Ser 

260 265 270 

Phe Glu Lys val Pro Phe His ser Met Tyr Ala His Ser Gin Ser Leu 

275 280 285 

Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Trp Gly Leu Gin 

290 295 300 

ser Thr Thr Thr Gly Thr Thr Leu Asn Ala Gly Thr Ala Thr Thr Asn 
305 310 315 320 

Phe Thr Lys Leu Arg Pro Thr Asn Phe Ser Asn Phe Lys Lys Asn Trp 

325 330 335 

Leu pro Gly Pro ser lie Lys Gin Gin Gly Phe Ser Lys Thr Ala Asn 
340 345 350 
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Gly 


Ser 


Asp 


ser 


Leu lie Lys 
365 


Tyr 


Arg 


Trp 


ser 


Ala 


Leu Thr Pro 


Gly 




380 






Ala 


Asp 


Ser 


Lys 


phe Ser Asn 


Ser 




395 




400 


Gin 


Asn 


Gly 


Asn 


Thr Ala Thr 


Val 




410 




415 




Glu 


Glu 


Glu 


Leu 


Ala Ala Thr 


Asn 


425 








430 




Asn 


Leu 


Pro 


Gly 


Gly Asp Gin 


Ser 








445 


val 


Arg 


Leu 


Thr 


Ala 


Leu Gly Ala 






460 


He 


Asp 


He 


Tyr 


Tyr 


Gin Gly Pro 




475 




480 


Gly 


His 


Phe 


His 


pro Ser Pro 


Leu 


490 






495 




pro 


Pro 


pro 


Gin 


He Phe lie 


Lys 


505 








510 




Ala 


Thr 


Thr 


Phe 


Ser Ser Thr 


pro 










525 




Ser 


Thr 


Gly 


Gin 


val Ser val 


Gin 






540 




Glu 


Arg 


ser 


Lys 


Arg 


Trp Asn Pro 




555 




560 


Gin 


Gin 


Asn 


Ser 


Leu Leu Trp 


Ala 




570 






575 




Glu 


Pro 


Arg 


Ala 


He Gly Thr 


Arg 


585 






590 





355 3 360 
Glu Thr His ser Thr Leu Asp Gly 

370 375 
pro Pro Met Ala Thr Ala Gly Pro 
385 „ 390 

Gin Leu lie Phe Ala Gly Pro Lys 
405 

pro Gly Thr Leu lie Phe Thr ser 
420 

Ala Thr Asp Thr Asp Met Trp Gly 
435 440 
Asn ser Asn Leu Pro Thr val Asp 

450 455 
Pro Gly Met val Trp Gin Asn Arg 
465 470 
Trp Ala Lys lie Pro His Thr Asp 
485 

He Gly Gly Phe Gly Leu Lys His 
500 

Asn Thr Pro Val Pro Ala Asn Pro 
515 520 
val Asn ser Phe lie Thr Gin Tyr 

530 535 
lie Asp Trp Glu lie Gin Lys Glu 
545 550 
val Gin Phe Thr Ser Asn Tyr Gly 
565 

pro Asp Ala Ala Gly Lys Tyr Thr 
580 

Tyr Leu Thr His His Leu 
595 

<210> 17 
<211> 1800 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<221> misc_feature 
<222> (0)...(0) 
<223> n=a,t,c, or g 

<221> variation 
<222> CO)... CO) 
<223> xaa = any amino acid 

<400> 17 co 

acggctcctg gaaagaagag accgttgatt gaatcccccc agcagcccga ctcctccacg 60 

ggtatcggca aaaaaggcaa gcagccggct aaaaagaagc tcgttttcga agacgaaact 120 

ggagcaggcg acggaccccc tgagggatca acttccggag ccatgtctga tgacagtgag 180 

atgcgtgcag cagctggcgg agctgcagtc gagggsggac aaggtgccga tggagtgggt 240 

aatgcctcgg gtgattggca ttgcgattcc acctggtctg agggccacgt cacgaccacc 300 

agcaccagaa cctgggtctt gcccacctac aacaaccacc tntacaagcg actcggagag 360 

agcctgcagt ccaacaccta caacggattc tccaccccct ggggatactt tgacttcaac 420 

cgcttccact gccacttctc accacgtgac tggcagcgac tcatcaacaa caactggggc 480 

atgcgaccca aagccatgcg ggtcaaaatc ttcaacatcc aggtcaagga ggtcacgacg 540 

tcgaacggcg agacaacggt ggctaataac cttaccagca cggttcagat ctttgcggac 600 

tcgtcgtacg aactgccgta cgtgatggat gcgggtcaag agggcagcct gcctcctttt 6bu 

cccaacgacg tctttatggt gccccagtac ggctactgtg qactggtgac cggcaacact 72U 
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tcgcagcaac agactgacag aaatgccttc tactgcctgg agtactttcc ttcgcagatg 780 

ctgcggactg gcaacaactt tgaaattacg tacagttttg agaaggtgcc tttccactcg 840 

atgtacgcgc acagccagag cctggaccgg ctgatgaacc ctctcatcga ccagtacctg 900 

tggggactgc aatcgaccac caccggaacc accctgaatg ccgggactgc caccaccaac 960 

tttaccaagc tgcggcctac caacttttcc aactttaaaa agaactggct gcccgggcct 1020 

tcaatcaagc agcagggctt ctcaaagact gccaatcaaa actacaagat ccctgccacc 1080 

qqgtcagaca gtctcatcaa atacgagacg cacagcactc tggacggaag atggagtgcc 1140 

ctgacccccg gacctccaat ggccacggct ggacctgcgg acagcaagtt cagcaacagc 1200 

cagctcatct ttgcggggcc taaacagaac ggcaacacgg ccaccgtacc cgggactctg 1260 

atcttcacct ctgaggagga gctggcagcc accaacgcca ccgatacgga catgtggggc 1320 

aacctacctg gcggtgacca gagcaacagc aacctgccga ccgtggacag actgacagcc 1380 

ttqqgagccg tgcctggaat ggtctggcaa aacagagaca tttactacca gggtcccatt 1440 

tgggccaaga ttcctcatac cgatggacac tttcacccct caccgctgat tggtgggttt 1500 

qggctgaaac acccgcctcc tcaaattttt atcaagaaca ccccggtacc tgcgaatcct 1560 

gcaacgacct tcagctctac tccggtaaac tccttcatta ctcagtacag cactggccag 1620 

gtgtcggtgc agattgactg ggagatccag aaggagcggt ccaaacgctg gaaccccgag 1680 

gtccagttta cctccaacta cggacagcaa aactctctgt tgtgggctcc cgatgcggct 1740 

gggaaataca ctgagcctag ggctatcggt acccgctacc tcacccacca cctgtaataa 1800 

<210> 18 
<211> 544 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

Met°Ser 8 Asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala val 

1 5 10 , 15 

Glu Gly Gly Gin Gly Ala Asp Gly Val Gly Asn Ala ser Gly Asp Trp 

20 25 30 

His cys Asp Ser Thr Trp Ser Glu Gly His Val Thr Thr Thr Ser Thr 

35 40 45 

Arq Thr Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu 

50 55 60 

Gly Glu Ser Leu Gin Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp 
65 70 75 80 

Gly Tyr Phe Asp Phe Asn Arg Phe His cys His Phe ser Pro Arg Asp 

85 90 95 

Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met 

K 100 105 HO 

Arq Val Lys lie Phe Asn lie Gin val Lys Glu val Thr Thr Ser Asn 

115 120 125 

Gly Glu Thr Thr val Ala Asn Asn Leu Thr Ser Thr Val Gin lie phe 

130 135 140 

Ala Asp ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu 
145 150 155 160 

Gly ser Leu pro Pro Phe Pro Asn Asp Val Phe Met val Pro Gin Tyr 

165 170 175 

Glv Tyr Cys Gly Leu Val Thr Gly Asn Thr ser Gin Gin Gin Thr Asp 

y y 180 185 190 

Arg Asn Ala Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 

195 200 205 

Thr Gly Asn Asn Phe Glu lie Thr Tyr ser Phe Glu Lys val Pro Phe 

210 215 220 

His ser Met Tyr Ala His ser Gin ser Leu Asp Arg Leu Met Asn Pro 
225 230 235 240 

Leu lie Asp Gin Tyr Leu Trp Gly Leu Gin ser Thr Thr Thr Gly Thr 

245 250 255 

Thr Leu Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro 
260 265 270 
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1(11 Hill NIC JCI r^jn rut i-jr j i_jr j n^n ■ ■ K ' ' w r'i 

275 280 285 

Lvs Gin Gin Gly Phe Ser Lys Thr Ala Asn Gin Asn Tyr Lys lie Pro 

290 295 300 

Ala Thr Gly ser Asp Ser Leu He Lys Tyr Glu Thr His Ser Thr Leu 
305 310 315 320 

asp Gly Arg Trp ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala 

325 330 n 335 

Gly Pro Ala Asp Ser Lys Phe Ser Asn ser Gin Leu lie Phe Ala Gly 

340 345 350 

pro Lys Gin Asn Gly Asn Thr Ala Thr Val Pro Gly Thr Leu He Phe 

355 360 365 

Thr Ser Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met 

370 375 380 

Trp Gly Asn Leu Pro Gly Gly Asp Gin ser Asn Ser Asn Leu Pro Thr 
385 390 395 400 

Val Asp Arg Leu Thr Ala Leu Gly Ala val Pro Gly Met val Trp Gin 

405 410 415 m 

Asn Arg Asp He Tyr Tyr Gin Gly Pro lie Trp Ala Lys lie Pro His 

420 425 430 

Thr Asp Gly His Phe His Pro Ser Pro Leu lie Gly Gly Phe Gly Leu 

435 440 u 445 

Lvs His Pro Pro Pro Gin lie Phe lie Lys Asn Thr Pro val Pro Ala 

450 455 460 

Asn Pro Ala Thr Thr Phe Ser ser Thr Pro val Asn Ser Phe He Thr 
465 470 475 n , 480 

Gin Tyr Ser Thr Gly Gin Val Ser val Gin lie Asp Trp Glu lie Gin 

485 490 495 

Lys Glu Arg Ser Lys Arg Trp Asn Pro Glu val Gin Phe Thr ser Asn 

500 505 510 

Tyr Gly Gin Gin Asn Ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys 

515 520 525 m 

Tyr Thr Glu Pro Arg Ala lie Gly Thr Arg Tyr Leu Thr His His Leu 
530 535 540 

<210> 19 
<211> 1617 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<221> misc_feature 
<222> (0)...(0) 
<223> n=a,t,c t or g 

<221> variation 

<222> (0)...(0) , 

<223> Xaa = any amino acid 



<400> 19 . cf\ 

atgcgtgcag cagctggcgg agctgcagtc gagggsggac aaggtgccga tggagtgggt 60 

aatgcctcgg gtgattggca ttgcgattcc acctggtctg agggccacgt cacgaccacc 120 

agcaccagaa cctgggtctt gcccacctac aacaaccacc tntacaagcg actcggagag 180 

agcctgcagt ccaacaccta caacggattc tccaccccct ggggatactt tgacttcaac 240 

cgcttccact gccacttctc accacgtgac tggcagcgac tcatcaacaa caactggggc 300 

atgcgaccca aagccatgcg ggtcaaaatc ttcaacatcc aggtcaagga ggtcacgacg 360 

tcgaacggcg agacaacggt ggctaataac cttaccagca cggttcagat ctttgcggac 420 

tcgtcgtacg aactgccgta cgtgatggat gcgggtcaag agggcagcct gcctcctttt 480 

cccaacgacg tctttatggt gccccagtac ggctactgtg gactggtgac cggcaacact 540 

tcgcagcaac agactgacag aaatgccttc tactgcctgg agtactttcc ttcgcagatg 600 
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ctgcggactg 
atgtacgcgc 
tggggactgc 
tttaccaagc 
tcaatcaagc 
gggtcagaca 
ctgacccccg 
cagctcatct 
atcttcacct 
aacctacctg 
ttgggagccg 
tgggccaaga 
gggctgaaac 
gcaacgacct 
gtgtcggtgc 
gtccagttta 
gggaaataca 



gcaacaactt 
acagccagag 
aatcgaccac 
tgcggcctac 
agcagggctt 
gtctcatcaa 
gacctccaat 
ttgcggggcc 
ctgaggagga 
gcggtgacca 
tgcctggaat 
ttcctcatac 
acccgcctcc 
tcagctctac 
agattgactg 
cctccaacta 
ctgagcctag 



tgaaattacg 
cctggaccgg 
caccggaacc 
caacttttcc 
ctcaaagact 
atacgagacg 
ggccacggct 
taaacagaac 
gctggcagcc 
gagcaacagc 
ggtctggcaa 
cgatggacac 
tcaaattttt 
tccggtaaac 
ggagatccag 
cggacagcaa 
ggctatcggt 



tacagrcrcg 
ctgatgaacc 
accctgaatg 
aactttaaaa 
gccaatcaaa 
cacagcactc 
ggacctgcgg 
ggcaacacgg 
accaacgcca 
aacctgccga 
aacagagaca 
tttcacccct 
atcaagaaca 
tccttcatta 
aaggagcggt 
aactctctgt 
acccgctacc 



agaaggxgcc 
ctctcatcga 
ccgggactgc 
agaactggct 
actacaagat 
tggacggaag 
acagcaagtt 
ccaccgtacc 
ccgatacgga 
ccgtggacag 
tttaetacca 
caccgctgat 
ccccggtacc 
ctcagtacag 
ccaaacgctg 
tgtgggctcc 
tcacccacca 



tttccactcg 
ccagtacctg 
caccaccaac 
gcccgggcct 
ccctgccacc 
atggagtgcc 
cagcaacagc 
cgggactctg 
catgtggggc 
actgacagcc 
gggtcccatt 
tggtgggttt 
tgcgaatcct 
cactggccag 
gaaccccgag 
cgatgcggct 
cctgtaa 



<210> 20 
<211> 129 
<212> DNA 

<213> Artificial Sequence 

<220> . „ 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 20 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctgcggcca gaggccggca 
gtctggagac ctttggtgtc cagggcaggg ccgagtgagt gagcgagcgc gcatagaggg 
agtggccaa 

<210> 21 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 21 

tctagtctag acttggccac tccctctctg cgcgc 

<210> 22 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 22 

aggccttaag agcagtcgtc caccaccttg ttcc 

<210> 23 
<211> 4652 
<212> DNA 

<213> Artificial Sequence 
<220> 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1617 



60 
120 
129 



35 



34 



<223> Description of Artificial Sequence; note = 
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synthetic construct 



<400> 23 

tggcactctc 

caaagagctg 

gcgaacgcga 

tgtcataatg 

tgttttatcc 

aagaccgagt 

gccatggcta 

cctggaattt 

gagtcagatt 

cgccgcgtgt 

cagtttgaaa 

tcttccatgg 

ttccagggaa 

ggagccaata 

ccggagcttc 

gaggagcgca 

gcggcttcgc 

tacatggcgc 

caggaaaatc 

aaggccgcgc 

ctcgtgggga 

atgaatggct 

ttcaacaaga 

gcggaggcca 

tttcccttta 

aacaaggtgg 

aaatgtaaat 

atgtgtgtgg 

cgcatgttca 

caggaagtca 

tttaaagttc 

ctgggtgacg 

gttcccgaga 

aattggaatt 

aacaaatgtg 

gtaactcact 

tttggggatt 

gttgatcacc 

cttgaagcgg 

cttgtgctgc 

gtcaacaggg 

gcgggagaca 

gccgacgaca 

gttctcgaac 

atagacgacc 

acctcgtcag 

ccagcctcaa 

aataaccaag 

tggatggggg 

aaccaccagt 

tttggataca 

ccccgagact 

gtcaaaatct 

gccaacaacc 

gtcgtcggca 

ccgcagtacg 

agcttcttct 

tttacctaca 

ttcaagctgg 

actggcggag 



ccccctgtcg 

ccagacgacg 

caggggggag 

atgtaatgct 

aataggaaga 

gaacgagccc 

cc'ttctatga 

ctgacagctt 

taaatttgac 

tcctgtacga 

agggatctga 

tcctcggccg 

ttgaacccca 

aggtggtgga 

agtgggcgtg 

aacggctcgt 

agcgtgagtt 

tcgtcaactg 

aggagagcta 

tcgacaacgc 

gctccgttcc 

acgacccggc 

ggaacaccgt 

tcgcccacac 

atgactgtgt 

ttgaatccgc 

cctctgttca 

tggtggatgg 

aatttgaact 

aggacttttt 

ccagggaatt 

tcaccaatac 

cgcctcgcag 

caaggtatga 

atgaatgtga 

gtcaaatttg 

ttgacgatgc 

ctccagattg 

gcccaccgaa 

ctggttataa 

cagacgaggt 

acccctacct 

catccttcgg 

cttttggcct 

actttccaaa 

acgccgaagc 

gtttgggagc 

gtgccgatgg 

acagagtcgt 

accgagagat 

gcaccccctg 

ggcaaagact 

tcaacattca 

tcacctccac 

acgggaccga 

gttacgcgac 

gcctagagta 

actttgagga 

ccaacccgct 

tccagttcaa 



cgttcgctcg 

gccctctggc 

agtgccacac 

tattgtcacg 

aagcgcgcgt 

gccgccattc 

agtcattgtt 

tgtggactgg 

tctggttgaa 

gtggaacaaa 

atattttcat 

ctacgtgagt 

gatcaacgac 

ttctgggtat 

gacaaacctg 

cgcgcagttt 

ctcggctgac 

gctcgtggag 

cctctccttc 

gaccaaaatt 

cgaggacatt 

ctacgcggga 

ctggctctac 

tgtgcccttt 

ggacaaaatg 

caaggccatc 

aattgattct 

gaattccacg 

gactaagcgg 

tgcttgggca 

ggcgggaact 

tagctataaa 

ttcagacgtg 

ttgcaaatgt 

atatttgaat 

tcatgggatt 

caataaagaa 

gttggaagaa 

accaaaaccc 

ctatctcgga 

cgcgcgagag 

caagtacaac 

gggaaacctc 

ggttgaagag 

aagaaagaag 

tggacccagc 

tgatacaatg 

agtgggcaat 

caccaagtcc 

caaaagcggc 

ggggtacttt 

catcaacaac 

agtcaaagag 

cgtccaagtg 

gggatgcctg 

gctgaaccgc 

ctttcccagc 

ggtgcccttc 

ggtggaccag 

caagaacctg 



ctcgctggct 

cgtcgccccc 

tctcaagcaa 

cgatagttaa 

atgagttctc 

tttgctctgg 

cgcgtcccat 

gtaactggtc 

cagcctcagt 

ttttccaagc 

ctgcacacgc 

cagattcgcg 

tgggtcgcca 

attcccgcct 

gacgagtata 

ctggcagaat 

ccggtcatca 

cacggcatca 

aactccaccg 

atgagtctga 

tcaaaaaaca 

tccatcctct 

ggacccgcca 

tacggctgcg 

ctcatttggt 

ctggggggct 

acccctgtca 

acctttgaac 

ctcccgccag 

aaggtcaatc 

aaaggggcgg 

agtctggaga 

actgttgatc 

gactatcatg 

cggggcaaaa 

cccccctggg 

cagtaaataa 

gttggtgaag 

aatcagcagc 

cccggaaacg 

cacgacatct 

cacgcggacg 

ggaaaggcag 

ggtgctaaga 

gctcggaccg 

ggatcccagc 

tctgcgggag 

gcctcgggag 

acccgaacct 

tccgtcgacg 

gactttaacc 

tactggggct 

gtcacggtgc 

tttacggacg 

ccggccttcc 

gacaacacag 

aagatgctga 

cactccagct 

tacttgtacc 

gccgggagat 



cgtttggggg 
ccaaacgagc 
gggggttttg 
tgattaacag 
gcgagacttc 
actgctagag 
ttgacgtgga 
aaatttggga 
tgacggtggc 
aggagtccaa 
ttgtggagac 
cccagctggt 
tcaccaaggt 
acctgctgcc 
aattggccgc 
cctcgcagcg 
aaagcaagac 
cttccgagaa 
gcaactctcg 
caaaaagcgc 
gaatctggca 
acggctggtg 
cgaccggcaa 
tgaactggac 
gggaggaggg 
caaaggtgcg 
ttgtaacttc 
accagcagcc 
attttggcaa 
aggtgccggt 
agaaatctct 
agcgggccag 
ccgctcctct 
ctcaatttga 
atggatgtat 
aaaaggaaaa 
agcgagtagt 
gtcttcgcga 
atcaagatca 
gtctcgatcg 
cgtacaacga 
ccgagtttca 
tctttcaggc 
cggcccctac 
aagaggactc 
agctgcaaat 
gtggcggccc 
attggcattg 
gggtgctgcc 

gaagcaacgc 
gcttccacag 
tcagaccccg 
aggactccac 
acgactacca 
ctccgcaggt 
aaaatcccac 
gaacgggcaa 
tcgctcccag 
gcttcgtgag 
acgccaacac 



ggtggcagct 

cagcgagcga 

taagcagtga 

tcatgtgatg 

cggggtataa 

gaccctcgct 

ggaacatctg 

gctgcctcca 

tgatagaatt 

attctttgtg 

ctccggcatc 

gaaagtggtc 

aaagaagggc 

gaaggtccaa 

cctgaatctg 

ctcgcaggag 

ttcccagaaa 

gcagtggatc 

gagccagatc 

ggtggactac 

aatttttgag 

tcagcgctcc 

gaccaacatc 

caatgaaaac 

aaagatgacc 

ggtcgatcag 

caatacaaac 

gctggaggac 

gattactaag 

gactcacgag 

aaaacgccca 

gctctcattt 

gcgaccgctc 

caacatttct 

ctgtcacaat 

cttgtcagat 

catgtctttt 

gtttttgggc 

agcccgtggt 

aggagagcct 

gcagcttgag 

ggagaagctc 

caagaaaagg 

cggaaagcgg 

caagccttcc 

cccagcccaa 

attgggcgac 

cgattccacg 

cagctacaac 

caacgcctac 

ccactggagc 

gtccctcaga 

caccaccatc 

gctgccctac 

ctttacgctg 

cgagaggagc 

caactttgag 

tcagaacctg 

cacaaataac 

ctacaaaaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 



24/60 



WO 2006/029196 



PCT/US2005/031837 



tggttcccgg ggcccatggg ccgaacccag ggctggaacc tgggctccgg ggtcaaccgc 3660 

gccagtgtca gcgccttcgc cacgaccaat aggatggagc tcgagggcgc gagttaccag 3720 

qtqcccccgc agccgaacgg catgaccaac aacctccagg gcagcaacac ctatgccctg 3780 

gagaacacta tgatcttcaa cagccagccg gcgaacccgg gcaccaccgc cacgtacctc 3840 

gagggcaaca tgctcatcac cagcgagagc gagacgcagc cggtgaaccg cgtggcgtac 3900 

aacgtcggcg ggcagatggc caccaacaac cagagctcca ccactgcccc cgcgaccggc 3960 

acgtacaacc tccaggaaat cgtgcccggc agcgtgtgga tggagaggga cgtgtacctc 4020 

caaggaccca tctgggccaa gatcccagag acgggggcgc actttcaccc ctctccggcc 4080 

atgggcggat tcggactcaa acacccaccg cccatgatgc tcatcaagaa cacgcctgtg 4140 

cccggaaata tcaccagctt ctcggacgtg cccgtcagca gcttcatcac ccagtacagc 4200 

accgggcagg tcaccgtgga gatggagtgg gagctcaaga aggaaaactc caagaggtgg 4260 

aacccagaga tccagtacac aaacaactac aacgaccccc agtttgtgga ctttgccccg 4320 

qacagcaccg gggaatacag aaccaccaga cctatcggaa cccgatacct tacccgaccc 4380 

ctttaaccca ttcatgtcgc ataccctcaa taaaccgtgt attcgtgtca gtaaaatact 4440 

gcctcttgtg gtcattcaat gaataacagc ttacaacatc tacaaaacct ccttgcttga 4500 

gagtgtggca ctctcccccc tgtcgcgttc gctcgctcgc tggctcgttt gggggggtgg 4560 

cagctcaaag agctgccaga cgacggccct ctggccgtcg cccccccaaa cgagccagcg 4620 

agcgagcgaa cgcgacaggg gggagagtgc ca 4652 

<210> 24 
<211> 390 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

Met Ala Leu val Asn Trp Leu val Glu His Gly lie Thr Ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asn Gin Glu Ser Tyr Leu ser Phe Asn Ser Thr 

20 25 30 

Gly Asn ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Thr Lys 

35 40 45 

lie Met Ser Leu Thr Lys Ser Ala val Asp Tyr Leu val Gly Ser Ser 

50 55 60 

Val Pro Glu Asp lie Ser Lys Asn Arg He Trp Gin lie Phe Glu Met 
65 70 75 80 

Asn Gly Tyr Asp Pro Ala Tyr Ala Gly ser lie Leu Tyr Gly Trp cys 

85 90 .95 

Gin Arg Ser Phe Asn Lys Arg Asn Thr Val Trp Leu Tyr Gly Pro Ala 

100 " 105 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 

115 120 125 

Phe Tyr Gly cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

cys val Asp Lys Met Leu lie Trp Trp Glu Glu Gly Lys Met Thr Asn 
145 150 155 160 

Lys Val val Glu ser Ala Lys Ala lie Leu Gly Gly ser Lys val Arg 

165 170 t 175 

Val Asp Gin Lys cys Lys ser ser Val Gin He Asp ser Thr Pro Val 

180 185 190 

He val Thr ser Asn Thr Asn Met cys val Val Val Asp Gly Asn ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Glu Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Lys Arg Leu Pro Pro Asp Phe Gly Lys lie Thr Lys Gin 
225 230 235 240 

Glu val Lys Asp Phe Phe Ala Trp Ala Lys val Asn Gin val Pro val 

245 250 255 

Thr His Glu Phe Lys val Pro Arg Glu Leu Ala Gly Thr Lys Gly Ala 
260 265 270 
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Glu Lys Ser Leu Lys Arg Pro Leu Gly Asp val Thr Asn Thr Ser Tyr 

275 280 285 

lvs ser Leu Glu Lys Arg Ala Arg Leu ser Phe val pro Glu Thr Pro 

290 295 300 

Arg ser Ser Asp Val Thr Val Asp Pro Ala Pro Leu Arg Pro Leu Asn 
305 310 315 320 

Trp Asn Ser Arg Tyr Asp cys Lys cys Asp Tyr His Ala Gin Phe Asp 

325 330 335 

Asn lie Ser Asn Lys Cys Asp Glu cys Glu Tyr Leu Asn Arg Gly Lys 

340 345 350 , 

Asn Gly Cys lie cys His Asn val Thr His cys Gin lie cys His Gly 

355 360 365 

lie Pro Pro Trp Glu Lys Glu Asn Leu ser Asp Phe Gly Asp Phe Asp 

370 375 380 

Asp Ala Asn Lys Glu Gin 
385 390 

<210> 25 
<211> 594 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

Glu°His 5 Leu Pro Gly lie ser Asp Ser Phe val Asp Trp Val Thr Gly 

1 5 10 " 15 

Gin lie Trp Glu Leu Pro Pro Glu ser Asp Leu Asn Leu Thr Leu val 

20 25 30 

Glu Gin Pro Gin Leu Thr Val Ala Asp Arg lie Arg Arg Val Phe Leu 

35 40 45 

Tyr Glu Trp Asn Lys Phe Ser Lys Gin Glu Ser Lys Phe Phe val Gin 

50 55 60 

Phe Glu Lys Gly Ser Glu Tyr Phe His Leu His Thr Leu Val Glu Thr 
65 70 75 80 

Ser Gly lie Ser Ser Met val Leu Gly Arg Tyr Val Ser Gin lie Arg 

85 90 95 

Ala Gin Leu Val Lys Val Val Phe Gin Gly lie Glu Pro Gin lie Asn 

100 105 110 

Asp Trp val Ala He Thr Lys Val Lys Lys Gly Gly Ala Asn Lys Val 

115 120 125 

Val Asp ser Gly Tyr lie Pro Ala Tyr Leu Leu Pro Lys val Gin Pro 

130 135 140 

Glu Leu Gin Trp Ala Trp Thr Asn Leu Asp Glu Tyr Lys Leu Ala Ala 
145 150 155 160 

Leu Asn Leu Glu Glu Arg Lys Arg Leu Val Ala Gin Phe Leu Ala Glu 

165 170 175 

ser ser Gin Arg ser Gin Glu Ala Ala ser Gin Arg Glu Phe ser Ala 

180 185 190 

Asp Pro val lie Lys ser Lys Thr Ser Gin Lys Tyr Met Ala Leu val 

195 200 205 ^ n 

Asn Trp Leu Val Glu His Gly He Thr ser Glu Lys Gin Trp lie Gin 

210 215 220 

Glu Asn Gin Glu ser Tyr Leu Ser Phe Asn ser Thr Gly Asn ser Arg 
225 230 235 240 

Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Thr Lys He Met ser Leu 

245 250 255 

Thr Lys ser Ala Val Asp Tyr Leu val Gly Ser ser val Pro Glu Asp 

260 265 270 

lie ser Lys Asn Arg lie Trp Gin lie Phe Glu Met Asn Gly Tyr Asp 
275 280 285 
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pro Ala Tyr Ala Gly Ser lie Leu Tyr Gly Trp cys Gin Arg ser Phe 

290 " 295 300 

Asn Lys Arg Asn Thr Val Trp Leu Tyr Gly Pro Ala Thr Thr Gly Lys 
305 310 315 320 

Thr Asn He Ala Glu Ala He Ala His Thr Val Pro Phe Tyr Gly cys 

325 330 335 

val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp Cys Val Asp Lys 

340 345 350 

Met Leu lie Trp Trp Glu Glu Gly Lys Met Thr Asn Lys Val val Glu 

355 360 365 

ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg Val Asp Gin Lys 

370 375 380 

Cys Lys ser Ser Val Gin He Asp ser Thr pro Val lie val Thr Ser 
385 390 395 400 

Asn Thr Asn Met Cys Val val val Asp Gly Asn ser Thr Thr Phe Glu 

405 410 415 

His Gin Gin Pro Leu Glu Asp Arg Met Phe Lys Phe Glu Leu Thr Lys 

420 425 430 

Arq Leu Pro Pro Asp Phe Gly Lys lie Thr Lys Gin Glu Val Lys Asp 

435 440 445 

Phe Phe Ala Trp Ala Lys val Asn Gin val Pro val Thr His Glu Phe 

450 455 460 

Lys val Pro Arg Glu Leu Ala Gly Thr Lys Gly Ala Glu Lys Ser Leu 
465 470 475 480 

Lys Arg Pro Leu Gly Asp Val Thr Asn Thr Ser Tyr Lys Ser Leu Glu 

485 490 495 

Lys Arg Ala Arg Leu Ser Phe val Pro Glu Thr Pro Arg Ser Ser Asp 

500 505 510 

val Thr val Asp Pro Ala Pro Leu Arg Pro Leu Asn Trp Asn ser Arg 

515 520 525 

Tyr Asp cys Lys Cys Asp Tyr His Ala Gin Phe Asp Asn lie Ser Asn 

530 535 540 

Lys cys Asp Glu Cys Glu Tyr Leu Asn Arg Gly Lys Asn Gly cys lie 
545 550 555 560 

cys His Asn Val Thr His Cys Gin He cys His Gly lie Pro Pro Trp 

565 570 575 

Glu Lys Glu Asn Leu ser Asp Phe Gly Asp Phe Asp Asp Ala Asn Lys 
580 585 590 

Glu Gin 

<210> 26 
<2U> 724 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

Met°ser 6 phe val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 

1 5 10 15 

Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 

20 25 30 

pro Asn Gin Gin His Gin Asp Gin Ala Arg Gly Leu val Leu Pro Gly 

35 40 45 

Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro val 

50 55 60 

Asn Arg Ala Asp Glu val Ala Arg Glu His Asp He ser Tyr Asn Glu 

65 70 75 . 80 

Gin Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 

85 90 95 
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Ala Glu Phe Gin Glu Lys Leu Ala Asp Asp Thr ser Phe Gly Gly Asn 

100 105 110 

Leu Gly Lys Ala val Phe Gin Ala Lys Lys Arg val Leu Glu Pro Phe 

115 120 125 

Gly Leu val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg He 

130 135 140 

Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu /Asp ser 
145 150 155 160 

Lvs Pro ser Thr ser ser Asp Ala Glu Ala Gly Pro ser Gly Ser Gin 

165 170 175 

Gin Leu Gin lie Pro Ala Gin Pro Ala Ser Ser Leu Gly Ala Asp Thr 

180 185 190 

Met ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gin Gly Ala 

195 200 ' 205 

Asp Gly val Gly Asn Ala ser Gly Asp Trp His cys Asp Ser Thr Trp 

210 215 220 

Met Gly Asp Arg val Val Thr Lys ser Thr Arg Thr Trp Val Leu Pro 
225 230 235 240 

ser Tyr Asn Asn His Gin Tyr Arg Glu lie Lys Ser Gly ser Val Asp 

245 " 250 255 

Gly ser Asn Ala Asn Ala Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr 

260 265 270 

Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gin 

275 280 285 

Arg Leu lie Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 

290 295 300 

Lys He Phe Asn lie Gin Val Lys Glu Val Thr val Gin Asp Ser Thr 
305 310 315 320 

Thr Thr lie Ala Asn Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp 

325 330 335 

Asp Asp Tyr Gin Leu Pro Tyr val val Gly Asn Gly Thr Glu Gly cys 

340 345 n 350 

Leu Pro Ala Phe Pro Pro Gin Val Phe Thr Leu Pro Gin Tyr Gly Tyr 

355 360 365 

Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 

370 375 380 

Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
385 390 395 m 400 

Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His ser Ser 

405 410 415 

Phe Ala Pro Ser Gin Asn Leu Phe Lys Leu Ala Asn Pro Leu val Asp 

420 425 430 

Gin Tyr Leu Tyr Arg Phe val ser Thr Asn Asn Thr Gly Gly val Gin 

435 440 445 

Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 

450 455 460 

Phe Pro Gly Pro Met Gly Arg Thr Gin Gly Trp Asn Leu Gly ser Gly 
465 470 475 480 

val Asn Arg Ala Ser Val ser Ala Phe Ala Thr Thr Asn Arg Met Glu 

485 490 495 

Leu Glu Gly Ala ser Tyr Gin val Pro Pro Gin pro Asn Gly Met Thr 

500 505 510 

Asn Asn Leu Gin Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met lie 

515 520 525 

Phe Asn Ser Gin Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 

530 535 540 

Gly Asn Met Leu lie Thr ser Glu Ser Glu Thr Gin Pro val Asn Arg 
545 550 555 560 

val Ala Tyr Asn val Gly Gly Gin Met Ala Thr Asn Asn Gin Ser Ser 

565 570 575 

Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gin Glu lie val Pro 

580 585 590 

Gly ser val Trp Met Glu Arg Asp Val Tyr Leu Gin Gly Pro lie Trp 
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595 600 605 

Ala Lys lie Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 

610 615 620 

Gly Gly Phe Gly Leu Lys His Pro Pro pro Met Met Leu He Lys Asn 
625 630 635 640 

Thr Pro Val Pro Gly Asn lie Thr Ser Phe Ser Asp val Pro Val ser 

645 650 655 

Ser Phe lie Thr Gin Tyr ser Thr Gly Gin val Thr val Glu Met Glu 

660 665 670 

Trp Glu Leu Lys Lys Glu Asn ser Lys Arg Trp Asn Pro Glu He Gin 

675 680 685 

Tyr Thr Asn Asn Tyr Asn Asp Pro Gin Phe Val Asp Phe Ala Pro Asp 

690 695 700 

Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro lie Gly Thr Arg Tyr Leu 
705 710 715 720 

Thr Arg Pro Leu 

<210> 27 
<211> 588 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 27 . 

Thr Ala Pro Thr Gly Lys Arg He Asp Asp His Phe Pro Lys Arg Lys 

1 5 10 15 

Lys Ala Arg Thr Glu Glu Asp ser Lys Pro ser Thr Ser ser Asp Ala 

20 25 30 

Glu Ala Gly Pro ser Gly Ser Gin Gin Leu Gin lie Pro Ala Gin Pro 

35 40 45 

Ala ser Ser Leu Gly Ala Asp Thr Met Ser Ala Gly Gly Gly Gly Pro 

50 55 60 

Leu Gly Asp Asn Asn Gin Gly Ala Asp Gly val Gly Asn Ala Ser Gly 
65 70 75 80 

Asp Trp His cys Asp Ser Thr Trp Met Gly Asp Arg Val Val Thr Lys 

Ser Thr Arg Thr Trp Val Leu Pro ser Tyr Asn Asn His Gin Tyr Arg 

100 105 110 

Glu He Lys ser Gly Ser Val Asp Gly Ser Asn Ala Asn Ala Tyr Phe 

115 120 125 

Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His ser 

130 135 n 140 

His Trp ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Tyr Trp Gly 
145 150 155 160 

Phe Arg Pro Arg Ser Leu Arg val Lys lie Phe Asn lie Gin Val Lys 

165 170 175 

Glu val Thr val Gin Asp Ser Thr Thr Thr lie Ala Asn Asn Leu Thr 

180 185 190 

ser Thr Val Gin val Phe Thr Asp Asp Asp Tyr Gin Leu Pro Tyr val 

195 200 205 

val Gly Asn Gly Thr Glu Gly cys Leu Pro Ala Phe Pro Pro Gin val 

210 215 220 

Phe Thr Leu Pro Gin Tyr Gly Tyr Ala Thr Leu Asn Arg Asp Asn Thr 
225 230 235 240 

Glu Asn Pro Thr Glu Arg Ser ser Phe Phe Cys Leu Glu Tyr Phe Pro 

245 250 255 

Ser Lys Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Thr Tyr Asn Phe 

260 265 270 

Glu Glu Val Pro Phe His Ser ser Phe Ala Pro Ser Gin Asn Leu Phe 
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275 280 285 

Lys Leu Ala Asn Pro Leu val Asp Gin Tyr Leu Tyr Arg Phe Val Ser 

290 295 300 

Thr Asn Asn Thr Gly Gly Val Gin Phe Asn Lys Asn Leu Ala Gly Arg 
305 310 315 3£0 

Tvr Ala Asn Thr Tyr Lys Asn Trp Phe Pro Gly Pro Met Gly Arg Thr 

325 330 335 

Gin Gly Trp Asn Leu Gly ser Gly val Asn Arg Ala ser val ser Ala 

340 345 350 

Phe Ala Thr Thr Asn Arg Met Glu Leu Glu Gly Ala Ser Tyr Gin val 

355 360 365 

pro Pro Gin Pro Asn Gly Met Thr Asn Asn Leu Gin Gly ser Asn Thr 

370 375 380 

Tyr Ala Leu Glu Asn Thr Met lie Phe Asn Ser Gin Pro Ala Asn Pro 
385 390 395 400 

Gly Thr Thr Ala Thr Tyr Leu Glu Gly Asn Met Leu lie Thr ser Glu 

405 410 415 

Ser Glu Thr Gin Pro Val Asn Arg val Ala Tyr Asn Val Gly Gly (Sin 

420 425 430 

Met Ala Thr Asn Asn Gin ser Ser Thr Thr Ala Pro Ala Thr Gly Thr 

435 440 445 

Tyr Asn Leu Gin Glu lie val Pro Gly Ser val Trp Met Glu Arg Asp 

450 455 460 

Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro Glu Thr Gly Ala 
465 470 475 , 480 

His Phe His Pro ser Pro Ala Met Gly Gly Phe Gly Leu Lys His Pro 

485 490 495 

pro Pro Met Met Leu He Lys Asn Thr Pro Val Pro Gly Asn lie Thr 

500 505 „ 510 

Ser Phe Ser Asp val Pro val Ser Ser Phe lie Thr Gin Tyr Ser Thr 

515 520 525 

Gly Gin val Thr val Glu Met Glu Trp Glu Leu Lys Lys Glu Asn Ser 

530 535 540 

Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Asn Asn Tyr Asn Asp Pro 
545 550 555 560 

Gin Phe Val Asp Phe Ala Pro Asp ser Thr Gly Glu Tyr Arg Thr Thr 

565 570 575 

Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
580 ~ 585 

<210> 28 
<211> 532 
<212> PRT 

<213> Artificial Sequence 

<220> .... 
<223> Description of Artificial Sequence; note = 
synthetic construct 

Met°Ser 8 Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gin Gly Ala 

1 5 10 15 

Asp Gly val Gly Asn Ala ser Gly Asp Trp His Cys Asp ser Thr Trp 

20 25 30 

Met Gly Asp Arg val Val Thr Lys Ser Thr Arg Thr Trp val Leu Pro 

35 " 40 45 

ser Tyr Asn Asn His Gin Tyr Arg Glu lie Lys ser Gly ser val Asp 

50 55 60 

Gly ser Asn Ala Asn Ala Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr 
65 70 75 80 

Phe asp Phe Asn Arg Phe His Ser His Trp ser Pro Arg Asp Trp Gin 

85 90 95 

Arq Leu He Asn Asn Tyr Trp Gly Phe Arg Pro Arg ser Leu Arg Val 
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100 105 110 

lvs lie Phe Asn lie Gin Val Lys Glu val Thr val Gin Asp Ser Thr 

115 120 125 

Thr Thr lie Ala Asn Asn Leu Thr Ser Thr val Gin val Phe Thr Asp 

130 135 140 

Asp Asp Tyr Gin Leu Pro Tyr val val Gly Asn Gly Thr Glu Gly cys 
145 150 155 160 

Leu Pro Ala Phe Pro Pro Gin Val Phe Thr Leu Pro Gin Tyr Gly Tyr 

165 170 175 

Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arcj ser Ser 

phe Phe Cys Leu Glu Tyr phe Pro Ser Lys Met Leu Arg Thr Gly Asn 

195 200 205 

Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu val Pro Phe His ser Ser 

210 215 220 

Phe Ala Pro Ser Gin Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
225 230 235 240 

Gin Tyr Leu Tyr Arg Phe val ser Thr Asn Asn Thr Gly Gly val Gin 

245 250 255 

Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 

260 265 270 

Phe Pro Gly Pro Met Gly Arg Thr Gin Gly Trp Asn Leu Gly ser Gly 

275 280 285 

Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 

290 295 300 

Leu Glu Gly Ala Ser Tyr Gin val Pro Pro Gin Pro Asn Gly Met Thr 
305 310 315 320 

Asn Asn Leu Gin Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met He 

325 330 335 

Phe Asn Ser Gin Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 

340 345 350 

Gly Asn Met Leu He Thr ser Glu Ser Glu Thr Gin Pro Val Asn Arg 

355 360 365 

val Ala Tyr Asn Val Gly Gly Gin Met Ala Thr Asn Asn Gin Ser Ser 

370 375 380 

Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gin Glu lie Val Pro 
385 390 395 400 

Gly ser val Trp Met Glu Arg Asp val Tyr Leu Gin Gly Pro lie Trp 

405 ~ 410 415 

Ala Lys He Pro Glu Thr Gly Ala His Phe His Pro ser Pro Ala Met 

420 425 430 

Glv Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu lie Lys Asn 

435 440 445 

Thr Pro val Pro Gly Asn lie Thr Ser Phe ser Asp Val Pro val Ser 

450 455 460 

ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Thr val Glu Met Glu 
465 470 475 480 

Trp Glu Leu Lys Lys Glu Asn ser Lys Arg Trp Asn Pro Glu He Gin 

485 490 495 

Tyr Thr Asn Asn Tyr Asn Asp Pro Gin Phe val Asp Phe Ala Pro Asp 

500 505 510 

ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro He Gly Thr Arg Tyr Leu 

515 520 525 

Thr Arg Pro Leu 
530 

<210> 29 
<211> 2307 
<212> DNA 

<213> Artificial sequence 

< 22 0> . , 

<223> Description of Artificial Sequence; note = 
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synthetic construct 



<400> 29 

aggctctcat 

ctgcgaccgc 

accctccaga 

cgggcccacc 

tgcctggtta 

gggcagacga 

acaaccccta 

acacatcctt 

aaccttttgg 

accactttcc 

cagacgccga 

caagtttggg 

aaggtgccga 

gggacagagt 

agtaccgaga 

acagcacccc 

actggcaaag 

tcttcaacat 

acctcacctc 

gcaacgggac 

acggttacgc 

tctgcctaga 

acaactttga 

tggccaaccc 

gagtccagtt 

cggggcccat 

tcagcgcctt 

cgcagccgaa 

ctatgatctt 

acatgctcat 

gcgggcagat 

acctccagga 

ccatctgggc 

gattcggact 

atatcaccag 

aggtcaccgt 

agatccagta 

ccggggaata 

ccattcatgt 



ttgttcccga 
tcaattggaa 
ttggttggaa 
gaaaccaaaa 
taactatctc 
ggtcgcgcga 
cctcaagtac 
cgggggaaac 
cctggttgaa 
aaaaagaaag 
agctggaccc 
agctgataca 
tggagtgggc 
cgtcaccaag 
gatcaaaagc 
ctgggggtac 
actcatcaac 
tcaagtcaaa 
caccgtccaa 
cgagggatgc 
gacgctgaac 
gtactttccc 
ggaggtgccc 
gctggtggac 
caacaagaac 
gggccgaacc 
cgccacgacc 
cggcatgacc 
caacagccag 
caccagcgag 
ggccaccaac 
aatcgtgccc 
caagatccca 
caaacaccca 
cttctcggac 
ggagatggag 
cacaaacaac 
cagaaccacc 
cgcataccct 



gacgcctcgc 
ttcaagtaaa 
gaagttggtg 
cccaatcagc 
ggacccggaa 
gagcacgaca 
aaccacgcgg 
ctcggaaagg 
gagggtgcta 
aaggctcgga 
agcggatccc 
atgtctgcgg 
aatgcctcgg 
tccacccgaa 
ggctccgtcg 
tttgacttta 
aactactggg 
gaggtcacgg 
gtgtttacgg 
ctgccggcct 
cgcgacaaca 
agcaagatgc 
ttccactcca 
cagtacttgt 
ctggccggga 
cagggctgga 
aataggatgg 
aacaacctcc 
ccggcgaacc 
agcgagacgc 
aaccagagct 
ggcagcgtgt 
gagacggggg 
ccgcccatga 
gtgcccgtca 
tgggagctca 
tacaacgacc 
agacctatcg 
caataaa 



agttcagacg 
taaagcgagt 
aaggtcttcg 
agcatcaaga 
acggtctcga 
tctcgtacaa 
acgccgagtt 
cagtctttca 
agacggcccc 
ccgaagagga 
agcagctgca 
gaggtggcgg 
gagattggca 
cctgggtgct 
acggaagcaa 
accgcttcca 
gcttcagacc 
tgcaggactc 
acgacgacta 
tccctccgca 
cagaaaatcc 
tgagaacggg 
gcttcgctcc 
accgcttcgt 
gatacgccaa 
acctgggctc 
agctcgaggg 
agggcagcaa 
cgggcaccac 
agccggtgaa 
ccaccactgc 
ggatggagag 
cgcactttca 
tgctcatcaa 
gcagcttcat 
agaaggaaaa 
cccagtttgt 
gaacccgata 



tgactgttga 
agtcatgtct 
cgagtttttg 
tcaagcccgt 
tcgaggagag 
cgagcagctt 
tcaggagaag 
ggccaagaaa 
taccggaaag 
ctccaagcct 
aatcccagcc 
cccattgggc 
ttgcgattcc 
gcccagctac 
cgccaacgcc 
cagccactgg 
ccggtccctc 
caccaccacc 
ccagctgccc 
ggtctttacg 
caccgagagg 
caacaacttt 
cagtcagaac 
gagcacaaat 
cacctacaaa 
cggggtcaac 
cgcgagttac 
cacctatgcc 
cgccacgtac 
ccgcgtggcg 
ccccgcgacc 
ggacgtgtac 
cccctctccg 
gaacacgcct 
cacccagtac 
ctccaagagg 
ggactttgcc 
ccttacccga 



tcccgctcct 
tttgttgatc 
ggccttgaag 
ggtcttgtgc 
cctgtcaaca 
gaggcgggag 
ctcgccgacg 
agggttctcg 
cggatagacg 
tccacctcgt 
caaccagcct 
gacaataacc 
acgtggatgg 
aacaaccacc 
tactttggat 
agcccccgag 
agagtcaaaa 
atcgccaaca 
tacgtcgtcg 
ctgccgcagt 
agcagcttct 
gagtttacct 
ctgttcaagc 
aacactggcg 
aactggttcc 
cgcgccagtg 
caggtgcccc 
ctggagaaca 
ctcgagggca 
tacaacgtcg 
ggcacgtaca 
ctccaaggac 
gccatgggcg 
gtgcccggaa 
agcaccgggc 
tggaacccag 
ccggacagca 
cccctttaac 



<210> 30 
<211> 2264 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 



<400> 30 

aggctctcat 

ctgcgaccgc 

gtttttgggc 

agcccgtggt 

aggagagcct 

gcagcttgag 

ggagaagctc 

caagaaaagg 

cggaaagcgg 

caagccttcc 



ttgttcccga 
tcaattggaa 
cttgaagcgg 
cttgtgctgc 
gtcaacaggg 
gcgggagaca 
gccgacgaca 
gttctcgaac 
atagacgacc 
acctcgtcag 



gacgcctcgc 
ttcaagattg 
gcccaccgaa 
ctggttataa 
cagacgaggt 
acccctacct 
catccttcgg 
cttttggcct 
actttccaaa 
acgccgaagc 



agttcagacg 
gttggaagaa 
accaaaaccc 
ctatctcgga 
cgcgcgagag 
caagtacaac 
gggaaacctc 
ggttgaagag 
aagaaagaag 
tggacccagc 



tgactgttga 
gttggtgaag 
aatcagcagc 
cccggaaacg 
cacgacatct 
cacgcggacg 
ggaaaggcag 
ggtgctaaga 
gctcggaccg 
ggatcccagc 



tcccgctcct 
gtcttcgcga 
atcaagatca 
gtctcgatcg 
cgtacaacga 
ccgagtttca 
tctttcaggc 
cggcccctac 
aagaggactc 
agctgcaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
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900 
960 
1020 
1080 
1140 
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1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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2040 
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2160 
2220 
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2307 



60 
120 
180 
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300 
360 
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600 



32/60 



WO 2006/029196 



PCT/US2005/031837 



cccagcccaa 
attgggcgac 
cgattccacg 
cagctacaac 
caacgcctac 
ccactggagc 
gtccctcaga 
caccaccatc 
gctgccctac 
ctttacgctg 
cgagaggagc 
caactttgag 
tcagaacctg 
cacaaataac 
ctacaaaaac 
ggtcaaccgc 
gagttaccag 
ctatgccctg 
cacgtacctc 
cgtggcgtac 
cgcgaccggc 
cgtgtacctc 
ctctccggcc 
cacgcctgtg 
ccagtacagc 
caagaggtgg 
ctttgccccg 
tacccgaccc 



ccagcctcaa 
aataaccaag 
tggatggggg 
aaccaccagt 
tttggataca 
ccccgagact 
gtcaaaatct 
gccaacaacc 
gtcgtcggca 
ccgcagtacg 
agcttcttct 
tttacctaca 
ttcaagctgg 
actggcggag 
tggttcccgg 
gccagtgtca 
gtgcccccgc 
gagaacacta 
gagggcaaca 
aacgtcggcg 
acgtacaacc 
caaggaccca 
atgggcggat 
cccggaaata 
accgggcagg 
aacccagaga 
gacagcaccg 
ctttaaccca 



gtttgggagc 
gtgccgatgg 
acagagtcgt 
accgagagat 
gcaccccctg 
ggcaaagact 
tcaacattca 
tcacctccac 
acgggaccga 
gttacgcgac 
gcctagagta 
actttgagga 
ccaacccgct 
tccagttcaa 
ggcccatggg 
gcgccttcgc 
agccgaacgg 
tgatcttcaa 
tgctcatcac 
ggcagatggc 
tccaggaaat 
tctgggccaa 
tcggactcaa 
tcaccagctt 
tcaccgtgga 
tccagtacac 
gggaatacag 
ttcatgtcgc 



tgatacaatg 
agtgggcaat 
caccaagtcc 
caaaagcggc 
ggggtacttt 
catcaacaac 
agtcaaagag 
cgtccaagtg 
gggatgcctg 
gctgaaccgc 
ctttcccagc 
ggtgcccttc 
ggtggaccag 
caagaacctg 
ccgaacccag 
cacgaccaat 
catgaccaac 
cagccagccg 
cagcgagagc 
caccaacaac 
cgtgcccggc 
gatcccagag 
acacccaccg 
ctcggacgtg 
gatggagtgg 
aaacaactac 
aaccaccaga 
ataccctcaa 



tctgcgggag 
gcctcgggag 
acccgaacct 
tccgtcgacg 
gactttaacc 
tactggggct 
gtcacggtgc 
tttacggacg 
ccggccttcc 
gacaacacag 
aagatgctga 
cactccagct 
tacttgtacc 
gccgggagat 
ggctggaacc 
aggatggagc 
aacctccagg 
gcgaacccgg 
gagacgcagc 
cagagctcca 
agcgtgtgga 
acgggggcgc 
cccatgatgc 
cccgtcagca 
gagctcaaga 
aacgaccccc 
cctatcggaa 
taaa 



gtggcggccc 
attggcattg 
gggtgctgcc 
gaagcaacgc 
gcttccacag 
tcagaccccg 
aggactccac 
acgactacca 
ctccgcaggt 
aaaatcccac 
gaacgggcaa 
tcgctcccag 
gcttcgtgag 
acgccaacac 
tgggctccgg 
tcgagggcgc 
gcagcaacac 
gcaccaccgc 
cggtgaaccg 
ccactgcccc 
tggagaggga 
actttcaccc 
tcatcaagaa 
gcttcatcac 
aggaaaactc 
agtttgtgga 
cccgatacct 



<210> 31 
<211> 2264 
<212> DNA 
<213> Artificial 



Sequence 



< 22 0> . . . 

<223> Description of Artificial Sequence; note = 

synthetic construct 



<400> 31 

aggctctcat 

ctgcgaccgc 

gtttttgggc 

agcccgtggt 

aggagagcct 

gcagcttgag 

ggagaagctc 

caagaaaagg 

cggaaagcgg 

caagccttcc 

cccagcccaa 

attgggcgac 

cgattccacg 

cagctacaac 

caacgcctac 

ccactggagc 

gtccctcaga 

caccaccatc 

gctgccctac 

ctttacgctg 

cgagaggagc 

caactttgag 

tcagaacctg 

cacaaataac 



ttgttcccga 
tcaattggaa 
cttgaagcgg 
cttgtgctgc 
gtcaacaggg 
gcgggagaca 
gccgacgaca 
gttctcgaac 
atagacgacc 
acctcgtcag 
ccagcctcaa 
aataaccaag 
tggatggggg 
aaccaccagt 
tttggataca 
ccccgagact 
gtcaaaatct 
gccaacaacc 
gtcgtcggca 
ccgcagtacg 
agcttcttct 
tttacctaca 
ttcaagctgg 
actggcggag 



gacgcctcgc 
ttcaagattg 
gcccaccgaa 
ctggttataa 
cagacgaggt 
acccctacct 
catccttcgg 
cttttggcct 
actttccaaa 
acgccgaagc 
gtttgggagc 
gtgccgatgg 
acagagtcgt 
accgagagat 
gcaccccctg 
ggcaaagact 
tcaacattca 
tcacctccac 
acgggaccga 
gttacgcgac 
gcctagagta 
actttgagga 
ccaacccgct 
tccagttcaa 



agttcagacg 
gttggaagaa 
accaaaaccc 
ctatctcgga 
cgcgcgagag 
caagtacaac 
gggaaacctc 
ggttgaagag 
aagaaagaag 
tggacccagc 
tgatacaatg 
agtgggcaat 
caccaagtcc 
caaaagcggc 
ggggtacttt 
catcaacaac 
agtcaaagag 
cgtccaagtg 
gggatgcctg 
gctgaaccgc 
ctttcccagc 
ggtgcccttc 
ggtggaccag 
caagaacctg 



tgactgttga 
gttggtgaag 
aatcagcagc 
cccggaaacg 
cacgacatct 
cacgcggacg 
ggaaaggcag 
ggtgctaaga 
gctcggaccg 
ggatcccagc 
tctgcgggag 
gcctcgggag 
acccgaacct 
tccgtcgacg 
gactttaacc 
tactggggct 
gtcacggtgc 
tttacggacg 
ccggccttcc 
gacaacacag 
aagatgctga 
cactccagct 
tacttgtacc 
qccgggagat 



tcccgctcct 
gtcttcgcga 
atcaagatca 
gtctcgatcg 
cgtacaacga 
ccgagtttca 
tctttcaggc 
cggcccctac 
aagaggactc 
agctgcaaat 
gtggcggccc 
attggcattg 
gggtgctgcc 
gaagcaacgc 
gcttccacag 
tcagaccccg 
aggactccac 
acgactacca 
ctccgcaggt 
aaaatcccac 
gaacgggcaa 
tcgctcccag 
gcttcgtgag 
acgccaacac 
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ctacaaaaac 
ggtcaaccgc 
gagttaccag 
ctatgccctg 
cacgtacctc 
cgtggcgtac 
cgcgaccggc 
cgtgtacctc 
ctctccggcc 
cacgcctgtg 
ccagtacagc 
caagaggtgg 
ctttgccccg 
tacccgaccc 



tggttcccgg 
gccagtgtca 
gtgcccccgc 
gagaacacta 
gagggcaaca 
aacgtcggcg 
acgtacaacc 
caaggaccca 
atgggcggat 
cccggaaata 
accgggcagg 
aacccagaga 
gacagcaccg 
ctttaaccca 



ggcccatggg 
gcgccttcgc 
agccgaacgg 
tgatcttcaa 
tgctcatcac 
ggcagatggc 
tccaggaaat 
tctgggccaa 
tcggactcaa 
tcaccagctt 
tcaccgtgga 
tccagtacac 
gggaatacag 
ttcatgtcgc 



ccgaacccag 
cacgaccaat 
catgaccaac 
cagccagccg 
cagcgagagc 
caccaacaac 
cgtgcccggc 
gatcccagag 
acacccaccg 
ctcggacgtg 
gatggagtgg 
aaacaactac 
aaccaccaga 
ataccctcaa 



ggctggaacc 
aggatggagc 
aacctccagg 
gcgaacccgg 
gagacgcagc 
cagagctcca 
agcgtgtgga 
acgggggcgc 
cccatgatgc 
cccgtcagca 
gagctcaaga 
aacgaccccc 
cctatcggaa 
taaa 



tgggctccgg 
tcgagggcgc 
gcagcaacac 
gcaccaccgc 
cggtgaaccg 
ccactgcccc 
tggagaggga 
actttcaccc 
tcatcaagaa 
gcttcatcac 
aggaaaactc 
agtttgtgga 
cccgatacct 



<210> 32 
<2U> 1292 
<212> DNA 

<213> Artificial Sequence 

< 220> • 

<223> Description of Artificial Sequence; note = 

synthetic construct 



<400> 32 

agcgcaaacg 

cttcgcagcg 

tggcgctcgt 

aaaatcagga 

ccgcgctcga 

tggggagctc 

atggctacga 

acaagaggaa 

aggccatcgc 

cctttaatga 

aggtggttga 

gtaaatcctc 

gtgtggtggt 

tgttcaaatt 

aagtcaagga 

aagttcccag 

gtgacgtcac 

ccgagacgcc 

ggaattcaag 

aatgtgatga 

ctcactgtca 

gggattttga 



gctcgtcgcg 
tgagttctcg 
caactggctc 
gagctacctc 
caacgcgacc 
cgttcccgag 
cccggcctac 
caccgtctgg 
ccacactgtg 
ctgtgtggac 
atccgccaag 
tgttcaaatt 
ggatgggaat 
tgaactgact 
cttttttgct 
ggaattggcg 
caatactagc 
tcgcagttca 
gtatgattgc 
atgtgaatat 
aatttgtcat 
cgatgccaat 



cagtttctgg 
gctgacccgg 
gtggagcacg 
tccttcaact 
aaaattatga 
gacatttcaa 
gcgggatcca 
ctctacggac 
cccttttacg 
aaaatgctca 
gccatcctgg 
gattctaccc 
tccacgacct 
aagcggctcc 
tgggcaaagg 
ggaactaaag 
tataaaagtc 
gacgtgactg 
aaatgtgact 
ttgaatcggg 
gggattcccc 
aaagaacagt 



cagaatcctc 
tcatcaaaag 
gcatcacttc 
ccaccggcaa 
gtctgacaaa 
aaaacagaat 
tcctctacgg 
ccgccacgac 
gctgcgtgaa 
tttggtggga 
ggggctcaaa 
ctgtcattgt 
ttgaacacca 
cgccagattt 
tcaatcaggt 
gggcggagaa 
tggagaagcg 
ttgatcccgc 
atcatgctca 
gcaaaaatgg 
cctgggaaaa 
aa 



gcagcgctcg 
caagacttcc 
cgagaagcag 
ctctcggagc 
aagcgcggtg 
ctggcaaatt 
ctggtgtcag 
cggcaagacc 
ctggaccaat 
ggagggaaag 
ggtgcgggtc 
aacttccaat 
gcagccgctg 
tggcaagatt 
gccggtgact 
atctctaaaa 
ggccaggctc 
tcctctgcga 
atttgacaac 
atgtatctgt 
ggaaaacttg 



caggaggcgg 
cagaaataca 
tggatccagg 
cagatcaagg 
gactacctcg 
tttgagatga 
cgctccttca 
aacatcgcgg 
gaaaactttc 
atgaccaaca 
gatcagaaat 
acaaacatgt 
gaggaccgca 
actaagcagg 
cacgagttta 
cgcccactgg 
tcatttgttc 
ccgctcaatt 
atttctaaca 
cacaatgtaa 
tcagattttg 



<210> 33 
<211> 1870 
<212> DNA 

<213> Artificial Sequence 

< 22 0> . , 

<223> Description of Artificial sequence; note = 
synthetic construct 



<400> 33 

attctttgct 

tgttcgcgtc 

ctgggtaact 

tgaacagcct 

caaattttcc 



ctggactgct 
ccatttgacg 
ggtcaaattt 
cagttgacgg 
aagcaggagt 



agaggaccct 
tggaggaaca 
gggagctgcc 
tggctgatag 
ccaaattctt 



cgctgccatg 
tctgcctgga 
tccagagtca 
aattcgccgc 
tgtgcagttt 



gctaccttct 
atttctgaca 
gatttaaatt 
gtgttcctgt 
gaaaagggat 



atgaagtcat 
gctttgtgga 
tgactctggt 
acgagtggaa 
ctgaatattt 



1500 
1560 
1620 
1680 
1740 
1800 
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1920 
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2040 
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2160 
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tcatctgcac 
gagtcagatt 
cgactgggtc 
gtatattccc 
cctggacgag 
gtttctggca 
tgacccggtc 
ggagcacggc 
cttcaactcc 
aattatgagt 
catttcaaaa 
gggatccatc 
ctacggaccc 
cttttacggc 
aatgctcatt 
catcctgggg 
ttctacccct 
cacgaccttt 
gcggctcccg 
ggcaaaggtc 
aactaaaggg 
taaaagtctg 
cgtgactgtt 
atgtgactat 
gaatcggggc 
gattcccccc 
agaacagtaa 



acgcttgtgg 
cgcgcccagc 
gccatcacca 
gcctacctgc 
tataaattgg 
gaatcctcgc 
atcaaaagca 
atcacttccg 
accggcaact 
ctgacaaaaa 
aacagaatct 
ctctacggct 
gccacgaccg 
tgcgtgaact 
tggtgggagg 
ggctcaaagg 
gtcattgtaa 
gaacaccagc 
ccagattttg 
aatcaggtgc 
gcggagaaat 
gagaagcggg 
gatcccgctc 
catgctcaat 
aaaaatggat 
tgggaaaagg 



agacctccgg 
tggtgaaagt 
aggtaaagaa 
tgccgaaggt 
ccgccctgaa 
agcgctcgca 
agacttccca 
agaagcagtg 
ctcggagcca 
gcgcggtgga 
ggcaaatttt 
ggtgtcagcg 
gcaagaccaa 
ggaccaatga 
agggaaagat 
tgcgggtcga 
cttccaatac 
agccgctgga 
gcaagattac 
cggtgactca 
ctctaaaacg 
ccaggctctc 
ctctgcgacc 
ttgacaacat 
gtatctgtca 
aaaacttgtc 



caxcrccTCC 
ggtcttccag 
gggcggagcc 
ccaaccggag 
tctggaggag 
ggaggcggct 
gaaatacatg 
gatccaggaa 
gatcaaggcc 
ctacctcgtg 
tgagatgaat 
ctccttcaac 
catcgcggag 
aaactttccc 
gaccaacaag 
tcagaaatgt 
aaacatgtgt 
ggaccgcatg 
taagcaggaa 
cgagtttaaa 
cccactgggt 
atttgttccc 
gctcaattgg 
ttctaacaaa 
caatgtaact 
agattttggg 



axggtcctcg 
ggaattgaac 
aataaggtgg 
cttcagtggg 
cgcaaacggc 
tcgcagcgtg 
gcgctcgtca 
aatcaggaga 
gcgctcgaca 
gggagctccg 
ggctacgacc 
aagaggaaca 
gccatcgccc 
tttaatgact 
gtggttgaat 
aaatcctctg 
gtggtggtgg 
ttcaaatttg 
gtcaaggact 
gttcccaggg 
gacgtcacca 
gagacgcctc 
aattcaaggt 
tgtgatgaat 
cactgtcaaa 
gattttgacg 



gccgctacgt 
cccagatcaa 
tggattctgg 
cgtggacaaa 
tcgtcgcgca 
agttctcggc 
actggctcgt 
gctacctctc 
acgcgaccaa 
ttcccgagga 
cggcctacgc 
ccgtctggct 
acactgtgcc 
gtgtggacaa 
ccgccaaggc 
ttcaaattga 
atgggaattc 
aactgactaa 
tttttgcttg 
aattggcggg 
atactagcta 
gcagttcaga 
atgattgcaa 
gtgaatattt 
tttgtcatgg 
atgccaataa 



<210> 34 
<211> 330 
<212> PRT 

<213> Artificial sequence 

<220> . „ 

<223> Description of Artificial 
synthetic construct 

<400> 34 

Met Ala Leu val Asn Trp Leu Val 

1 5 
Gin Trp lie Gin Glu Asn Gin Glu 
20 

Gly Asn ser Arg ser Gin lie Lys 
35 40 
He Met ser Leu Thr Lys ser Ala 

50 55 
val Pro Glu Asp lie Ser Lys Asn 

65 7 ? 
Asn Gly Tyr Asp Pro Ala Tyr Ala 
85 

Gin Arg Ser Phe Asn Lys Arg Asn 
100 

Thr Thr Gly Lys Thr Asn lie Ala 
115 120 
Phe Tyr Gly cys Val Asn Trp Thr 

130 135 
cys val Asp Lys Met Leu lie Trp 
145 150 
Lys val val Glu ser Ala Lys Ala 
165 

Val Asp Gin Lys Cys Lys Ser ser 
180 

lie Val Thr Ser Asn Thr Asn Met 



Sequence; note = 



Glu His 
10 

Ser Tyr 
25 

Ala Ala 
val Asp 
Arg lie 



Gly lie Thr 

Leu Ser Phe 

Leu Asp Asn 
45 

Tyr Leu val 
60 

Trp Gin lie 
75 

lie Leu Tyr 



Gly Ser 
90 

Thr Val Trp Leu Tyr 
105 

Glu Ala 



Asn Glu 

Trp Glu 

lie Leu 
170 
val Gin 
185 

Cys Val 



lie Ala His 
125 
Asn Phe Pro 

140 
Glu Gly Lys 
155 

Gly Gly ser 
lie Asp Ser 
val val Asp 



Ser Glu Lys 
15 

Asn Ser Thr 
30 

Ala Thr Lys 

Gly ser Ser 

Phe Glu Met 
80 

Gly Trp cys 
95 

Gly Pro Ala 
110 

Thr Val Pro 

Phe Asn Asp 

Met Thr Asn 
160 

Lys val Arg 

175 
Thr Pro val 
190 

Gly Asn ser 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1870 
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195 

Thr Thr Phe 

210 
Glu Leu Thr 
225 

Glu val Lys 

Thr His Glu 

Glu Lys ser 
275 

Lys ser Leu 

290 
Arg ser Ser 
305 

Trp Asn Ser 



Glu His 

Lys Arg 

Asp Phe 
245 
Phe Lys 
260 

Leu Lys 

Glu Lys 

Asp Val 

Arg Leu 
325 



Gin Gin 
215 
Leu Pro 
230 

Phe Ala 

Val Pro 

Arg Pro 

Arg Ala 
295 
Thr val 
310 

val Gly 



200 

pro Leu 

pro Asp 

Trp Ala 

Arg Glu 
265 
Leu Gly 
280 

Arg Leu 
Asp Pro 
Arg Ser 



Glu Asp Arg 
220 

Phe Gly Lys 

235 
Lys val Asn 
250 

Leu Ala Gly 

Asp val Thr 

ser Phe val 
300 

Ala Pro Leu 
315 

Trp 
330 



205 

Met Phe 

lie Thr 

Gin val 

Thr Lys 
270 
Asn Thr 
285 

Pro Glu 
Arg Pro 



Lys Phe 

Lys Gin 
240 
Pro val 
255 

Gly Ala 

ser Tyr 

Thr Pro 

Leu Asn 
320 



<210> 35 
<211> 1115 
<212> DNA 
<213> Artificial 



Sequence 



<220> . „ 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 35 

aggagcgcaa 

cggcttcgca 

acatggcgct 

aggaaaatca 

aggccgcgct 

tcgtggggag 

tgaatggcta 

tcaacaagag 

cggaggccat 

ttccctttaa 

acaaggtggt 

aatgtaaatc 

tgtgtgtggt 

gcatgttcaa 

aggaagtcaa 

ttaaagttcc 

tgggtgacgt 

ttcccgagac 

attggaattc 



acggctcgtc 
gcgtgagttc 
cgtcaactgg 
ggagagctac 
cgacaacgcg 
ctccgttccc 
cgacccggcc 
gaacaccgtc 
cgcccacact 
tgactgtgtg 
tgaatccgcc 
ctctgttcaa 
ggtggatggg 
atttgaactg 
ggactttttt 
cagggaattg 
caccaatact 
gcctcgcagt 
aagattggtt 



gcgcagtttc 
tcggctgacc 
ctcgtggagc 
ctctccttca 
accaaaatta 
gaggacattt 
tacgcgggat 
tggctctacg 
gtgccctttt 
gacaaaatgc 
aaggccatcc 
attgattcta 
aattcgacga 
actaagcggc 
gcttgggcaa 
gcgggaacta 
agctataaaa 
tcagacgtga 
ggaagaagtt 



tggcagaatc 
cggtcatcaa 
acggcatcac 
actccaccgg 
tgagtctgac 
caaaaaacag 
ccatcctcta 
gacccgccac 
acggctgcgt 
tcatttggtg 
tggggggctc 
cccctgtcat 
cctttgaaca 
tcccgccaga 
aggtcaatca 
aaggggcgga 
gtctggagaa 
ctgttgatcc 
ggtga 



ctcgcagcgc 
aagcaagact 
ttccgagaag 
caactctcgg 
aaaaagcgcg 
aatctggcaa 
cggctggtgt 
gaccggcaag 
gaactggacc 
ggaggaggga 
aaaggtgcgg 
tgtaacttcc 
ccagcagccg 
ttttggcaag 
ggtgccggtg 
gaaatctcta 
gcgggccagg 
cgctcctctg 



tcgcaggagg 
tcccagaaat 
cagtggatcc 
agccagatca 
gtggactacc 
atttttgaga 
cagcgctcct 
accaacatcg 
aatgaaaact 
aagatgacca 
gtcgatcaga 
aatacaaaca 
ctggaggacc 
attactaagc 
actcacgagt 
aaacgcccac 
ctctcatttg 
cgaccgctca 



<210> 36 
<211> 550 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

Met°Ala 6 Thr Phe Tyr Glu val lie val Arg val Pro Phe Asp val Glu 

1 5 10 15 

Glu His Leu Pro Gly lie ser Asp Ser Phe val Asp Trp Val Thr Gly 

20 25 30 

Gin lie Trp Glu Leu Pro Pro Glu Ser Asp Leu Asn Leu Thr Leu Val 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1115 
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35 40 45 

Glu Gin Pro Gin Leu Thr val Ala Asp Arg lie Arg Arg val Phe Leu 

50 55 60 

Tyr Glu Trp Asn Lys Phe Ser Lys Gin Glu Ser Lys Phe Phe val Gin 
65 70 75 

Phe Glu Lys Gly Ser Glu Tyr Phe His Leu His Thr Leu Val Glu Thr 

85 90 ^ 95 

ser Gly He ser Ser Met Val Leu Gly Arg Tyr Val Ser Gin lie Arg 

100 105 110 

Ala Gin Leu Val Lys Val Val Phe Gin Gly He Glu Pro Gin He Asn 

115 " 120 125 

Asp Trp val Ala lie Thr Lys Val Lys Lys Gly Gly Ala Asn Lys val 

130 135 140 

val Asp Ser Gly Tyr lie Pro Ala Tyr Leu Leu Pro Lys Val Gin Pro 
145 150 155 160 

Glu Leu Gin Trp Ala Trp Thr Asn Leu Asp Glu Tyr Lys Leu Ala Ala 

165 170 175 

Leu Asn Leu Glu Glu Arg Lys Arg Leu val Ala Gin Phe Leu Ala Glu 

180 185 n 1?0 

ser ser Gin Arg Ser Gin Glu Ala Ala ser Gin Arg Glu Phe ser Ala 

195 200 205 

Asp Pro Val lie Lys Ser Lys Thr Ser Gin Lys Tyr Met Ala Leu Val 

210 215 220 

Asn Trp Leu val Glu His Gly He Thr ser Glu Lys Gin Trp He Gin 
225 230 235 240 

Glu Asn Gin Glu ser Tyr Leu Ser Phe Asn Ser Thr Gly Asn ser Arg 

245 250 255 

ser Gin He Lys Ala Ala Leu Asp Asn Ala Thr Lys lie Met Ser Leu 

260 265 n 270 

Thr Lys ser Ala Val Asp Tyr Leu Val Gly ser Ser val Pro Glu Asp 

275 280 285 

lie Ser Lys Asn Arg lie Trp Gin He Phe Glu Met Asn Gly Tyr Asp 

290 295 300 

pro Ala Tyr Ala Gly Ser He Leu Tyr Gly Trp cys Gin Arg ser Phe 
305 310 315 320 

Asn Lys Arg Asn Thr val Trp Leu Tyr Gly Pro Ala Thr Thr Gly Lys 

325 330 335 

Thr Asn He Ala Glu Ala He Ala His Thr val Pro Phe Tyr Gly Cys 

340 345 350 

val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp Cys val Asp Lys 

355 360 365 

Met Leu He Trp Trp Glu Glu Gly Lys Met Thr Asn Lys val val Glu 

370 375 , 380 

ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg Val Asp Gin Lys 
385 390 395 400 

cys Lys ser ser Val Gin He Asp ser Thr Pro Val He val Thr ser 

405 410 415 

Asn Thr Asn Met Cys Val Val val Asp Gly Asn Ser Thr Thr Phe Glu 

420 425 430 

His Gin Gin Pro Leu Glu Asp Arg Met Phe Lys Phe Glu Leu Thr Lys 

435 440 445 

Arg Leu Pro Pro Asp Phe Gly Lys He Thr Lys Gin Glu Val Lys Asp 

450 455 460 

Phe Phe Ala Trp Ala Lys Val Asn Gin Val Pro val Thr His Glu Phe 
465 470 475 480 

Lys Val Pro Arg Glu Leu Ala Gly Thr Lys Gly Ala Glu Lys ser Leu 

485 490 495 

Lys Arg Pro Leu Gly Asp val Thr Asn Thr Ser Tyr Lys ser Leu Glu 

Lys Arg Ala Arg Leu ser Phe Val Pro Glu Thr Pro Arg Ser Ser Asp 

515 520 525 

val Thr Val Asp Pro Ala Pro Leu Arg Pro Leu Asn Trp Asn ser Arg 
530 535 540 
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Leu val Gly Arg ser Trp 
545 550 

<210> 37 
<211> 1690 
<212> DNA 

<213> Artificial sequence 

<220> ..... 
<223> Description of Artificial sequence; note = 
synthetic construct 



<400> 37 

attctttgct 

tgttcgcgtc 

ctgggtaact 

tgaacagcct 

caaattttcc 

tcatctgcac 

gagtcagatt 

cgactgggtc 

gtatattccc 

cctggacgag 

gtttctggca 

tgacccggtc 

ggagcacggc 

cttcaactcc 

aattatgagt 

catttcaaaa 

gggatccatc 

ctacggaccc 

cttttacggc 

aatgctcatt 

catcctgggg 

ttctacccct 

cacgaccttt 

gcggctcccg 

ggcaaaggtc 

aactaaaggg 

taaaagtctg 

cgtgactgtt 

aagttggtga 



ctggactgct 
ccatttgacg 
ggtcaaattt 
cagttgacgg 
aagcaggagt 
acgcttgtgg 
cgcgcccagc 
gccatcacca 
gcctacctgc 
tataaattgg 
gaatcctcgc 
atcaaaagca 
atcacttccg 
accggcaact 
ctgacaaaaa 
aacagaatct 
ctctacggct 
gccacgaccg 
tgcgtgaact 
tggtgggagg 
ggctcaaagg 
gtcattgtaa 
gaacaccagc 
ccagattttg 
aatcaggtgc 
gcggagaaat 
gagaagcggg 
gatcccgctc 



agaggaccct 
tggaggaaca 
gggagctgcc 
tggctgatag 
ccaaattctt 
agacctccgg 
tggtgaaagt 
aggtaaagaa 
tgccgaaggt 
ccgccctgaa 
agcgctcgca 
agacttccca 
agaagcagtg 
ctcggagcca 
gcgcggtgga 
ggcaaatttt 
ggtgtcagcg 
gcaagaccaa 
ggaccaatga 
agggaaagat 
tgcgggtcga 
cttccaatac 
agccgctgga 
gcaagattac 
cggtgactca 
ctctaaaacg 
ccaggctctc 
ctctgcgacc 



cgctgccatg 
tctgcctgga 
tccagagtca 
aattcgccgc 
tgtgcagttt 
catctcttcc 
ggtcttccag 
gggcggagcc 
ccaaccggag 
tctggaggag 
ggaggcggct 
gaaatacatg 
gatccaggaa 
gatcaaggcc 
ctacctcgtg 
tgagatgaat 
ctccttcaac 
catcgcggag 
aaactttccc 
gaccaacaag 
tcagaaatgt 
aaacatgtgt 
ggaccgcatg 
taagcaggaa 
cgagtttaaa 
cccactgggt 
atttgttccc 
gctcaattgg 



gctaccttct 
atttctgaca 
gatttaaatt 
gtgttcctgt 
gaaaagggat 
atggtcctcg 
ggaattgaac 
aataaggtgg 
cttcagtggg 
cgcaaacggc 
tcgcagcgtg 
gcgctcgtca 
aatcaggaga 
gcgctcgaca 
gggagctccg 
ggctacgacc 
aagaggaaca 
gccatcgccc 
tttaatgact 
gtggttgaat 
aaatcctctg 
gtggtggtgg 
ttcaaatttg 
gtcaaggact 
gttcccaggg 
gacgtcacca 
gagacgcctc 
aattcaagat 



atgaagtcat 
gctttgtgga 
tgactctggt 
acgagtggaa 
ctgaatattt 
gccgctacgt 
cccagatcaa 
tggattctgg 
cgtggacaaa 
tcgtcgcgca 
agttctcggc 
actggctcgt 
gctacctctc 
acgcgaccaa 
ttcccgagga 
cggcctacgc 
ccgtctggct 
acactgtgcc 
gtgtggacaa 
ccgccaaggc 
ttcaaattga 
atgggaattc 
aactgactaa 
tttttgcttg 
aattggcggg 
atactagcta 
gcagttcaga 
tggttggaag 



<210> 38 
<211> 145 
<212> DNA 

<213> Artificial Sequence 

< 22 0> .... 

<223> Description of Artificial sequence; note = 

synthetic construct 
<400> 38 

ccatcaccaa ggtaaagaag ggcggagcca ataaggtggt ggattctggg tatattcccg 
cctacctgct gccgaaggtc caaccggagc ttcagtgggc gtggacaaac ctggacgagt 
ataaattggc cgccctgaat ctgga 

<210> 39 
<211> 174 
<212> DNA 

<213> Artificial Sequence 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1690 



60 
120 
145 



<220> 
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<223> Description of Artificial Sequence; noxe = 
synthetic construct 

taagcaggaa gtcaaggact tttttgcttg ggcaaaggtc aatcaggtgc cggtgactca 60 
cqagtttaaa gttcccaggg aattggcggg aactaaaggg gcggagaaat ctctaaaacg 120 
cccactgggt gacgtcacca atactagcta taaaagtctg gagaagcggg ccag 174 

<210> 40 
<211> 187 
<212> DNA 

<213> Artificial Sequence 

<220> _ . n 

<223> Description of Artificial sequence; note = 
synthetic construct 

cactctcaag caagggggtt ttgtaagcag tgatgtcata atgatgtaat gcttattgtc 60 

acgcgatagt taatgattaa cagtcatgtg atgtgtttta tccaatagga agaaagcgcg 120 

cgtatgagtt ctcgcgagac ttccggggta taaaagaccg agtgaacgag cccgccgcca 180 

ttctttg ±5/ 

<210> 41 
<211> 168 
<212> DNA 

<213> Artificial sequence 

< 220> . n 

<223> Description of Artificial sequence; note = 

synthetic construct 

aaacctcctt gcttgagagt gtggcactct cccccctgtc gcgttcgctc gctcgctggc 60 
tcgtttgggg gggtggcagc tcaaagagct gccagacgac ggccctctgg ccgtcgcccc 120 
cccaaacgag ccagcgagcg agcgaacgcg acagggggga gagtgcca xt* 

<210> 42 
<211> 168 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence; note = 
synthetic construct 

aaa?ct^tt gcttgagagt gtggcactct cccccctgtc gcgttcgctc ^tcgctggc 60 
tcgtttgggg gggcgacggc cagagggccg tcgtctgccg gctctttgag ctgccacccc 120 
cccaaacgag ccagcgagcg agcgaacgcg acagggggga gagtgcca J-oo 

<210> 43 
<211> 8 
<212> DNA 

<213> Artificial Sequence 



<220> „ . , 

<223> Description of Artificial Sequence; note 
synthetic construct 



<400> 43 
cggtgtga 
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<210> 44 
<211> 8 
<212> DNA 

<213> Artificial Sequence 

<220> ^ BjfS . . _ 

<223> Description of Artificial Sequence; note 
synthetic construct 

<400> 44 
cggttgag 

<210> 45 
<211> 21 
<212> DNA 
<213> Artificial 

< 22 0> . 

<223> Description of Artificial sequence; note = 

synthetic construct 
<400> 45 

caaaacctcc ttgcttgaga g 

<210> 46 
<211> 4675 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence; note = 
synthetic construct 



Sequence 



<400> 46 

ttggccactc 

cgacgcccgg 

gccaactcca 

ggttagggag 

gtggtcacgc 

ggtttgaacg 

accttgacgg 

aatgggagtt 

ccgtggccga 

cggaggccct 

tcgtggaaac 

aaaaactgat 

tcacaaagac 

ccaattactt 

agtatttaag 

cgcacgtgtc 

cggtgatcag 

aggggattac 

atgcggcctc 

tgagcctgac 

ccagcaatcg 

ccgtctttct 

ggcctgcaac 

acgggtgcgt 

tgatctggtg 

tcggaggaag 

ctcccgtgat 

ccttcgaaca 

tggatcatga 



cctctctgcg 
gctttgcccg 
tcactagggg 
gtcctgtatt 
tgggtattta 
cgcagccgcc 
gcatctgccc 
gccgccagat 
gaagctgcag 
tttctttgtg 
caccggggtg 
tcagagaatt 
cagaaatggc 
gctccccaaa 
cgcctgtttg 
gcagacgcag 
atcaaaaact 
ctcggagaag 
caactcgcgg 
taaaaccgcc 
gatttataaa 
gggatgggcc 
taccgggaag 
aaactggacc 
ggaggagggg 
caaggtgcgc 
cgtcacctcc 
ccagcagccg 
ctttgggaag 



cgctcgctcg 
ggcggcctca 
ttcctggagg 
agaggtcacg 
agcccgagtg 
atgccggggt 
ggcatttctg 
tctgacatgg 
cgcgactttc 
caatttgaga 
aaatccatgg 
taccgcggga 
gccggaggcg 
acccagcctg 
aatctcacgg 
gagcagaaca 
tcagccaggt 
cagtggatcc 
tcccaaatca 
cccgactacc 
attttggaac 
acgaaaaagt 
accaacatcg 
aatgagaact 
aagatgaccg 
gtggaccaga 
aacaccaaca 
ttgcaagacc 
gtcaccaagc 



ctcactgagg 

gtgagcgagc 

ggtggagtcg 

tgagtgtttt 

agcacgcagg 

tttacgagat 

acagctttgt 

atctgaatct 

tgacggaatg 

agggagagag 

ttttgggacg 

tcgagccgac 

ggaacaaggt 

agctccagtg 

agcgtaaacg 

aagagaatca 

acatggagct 

aggaggacca 

aggctgcctt 

tggtgggcca 

taaacgggta 

tcggcaagag 

cggaggccat 

ttcccttcaa 

ccaaggtcgt 

aatgcaagtc 

tgtgcgccgt 

ggatgttcaa 

aggaagtcaa 



ccgggcgacc 
gagcgcgcag 
tgacgtgaat 
gcgacatttt 
gtctccattt 
tgtgattaag 
gaactgggtg 
gattgagcag 
gcgccgtgtg 
ctacttccac 
tttcctgagt 
tttgccaaac 
ggtggatgag 
ggcgtggact 
gttggtggcg 
gaatcccaat 
ggtcgggtgg 
ggcctcatac 
ggacaatgcg 
gcagcccgtg 
cgatccccaa 
gaacaccatc 
agcccacact 
cgactgtgtc 
ggagtcggcc 
ctcggcccag 
gattgacggg 
atttgaactc 
agactttttc 



aaaggtcgcc 
agagggagtg 
tacgtcatag 
gcgacaccat 
tgaagcggga 
gtccccagcg 
gccgagaagg 
gcacccctga 
agtaaggccc 
atgcacgtgc 
cagattcgcg 
tggttcgcgg 
tgctacatcc 
aatatggaac 
cagcatctga 
tctgatgcgc 
ctcgtggaca 
atctccttca 
ggaaagatta 
gaggacattt 
tatgcggctt 
tggctgtttg 
gtgcccttct 
gacaagatgg 
aaagccattc 
atagacccga 
aactcaacga 
acccgccgtc 
cggtgggcaa 



21 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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aggatcacgt 

gacccgcccc 

agccatcgac 

gttctcgtca 

atcagaattc 

tgtcagaatc 

atcatatcat 

tggatgactg 

cttceagatt 

cctggcccac 

cttcctgggt 

gaggcagacg 

agacaacccg 

agatacgtct 

tgaacctctg 

agagcactct 

tgcaagaaaa 

gcctctcgga 

cagtggcgca 

aaattggcat 

ctgggccctg 

ctcgaacgac 

attccactgc 

ccgacccaag 

tgacggtacg 

ggagtaccag 

agcagacgtc 

agtaggacgc 

aaacaacttt 

cagccagagt 

cagaacaaac 

agcgagtgac 

gcgagtatca 

caagtaccac 

cgatgaagaa 

gaaaacaaat 

caatcccgtg 

acaagcagct 

cagagatgtg 

tcacccctct 

caagaacacc 

cttcatcaca 

aggaaaacag 

ttaatcgtgg 

ccagatacct 

cagttgaact 

agtagcatgg 

cctctctgcg 

gctttgcccg 



ggttgaggtg 

cagtgacgca 

gtcagacgcg 

cgtgggcatg 

aaatatctgc 

tcaacccgtt 

gggaaaggtg 

catctttgaa 

ggctcgagga 

caccaccaaa 

acaagtacct 

ccgcggccct 

tacctcaagt 

tttgggggca 

ggcctggttg 

cctgtggagc 

agattgaatt 

cagccaccag 

ccaatggcag 

tgcgattcca 

cccacctaca 

aatcactact 

cacttttcac 

agactcaact 

acgacgattg 

ctcccgtacg 

ttcatggtgc 

tcttcatttt 

accttcagct 

ctggaccgtc 

actccaagtg 

attcgggacc 

aagacatctg 

ctcaatggca 

aagttttttc 

gtgaacattg 

gctacggagc 

accgcagatg 

taccttcagg 

cccctcatgg 

ccggtacctg 

cagtactcca 

caaacgctgg 

acttaccgtg 

gactcgtaat 

ttggtctctg 

cgggttaatc 

cgctcgctcg 

ggcggcctca 



gagcatgaat 

gatataagtg 

gaagcttcga 

aatctgatgc 

ttcactcacg 

tctgtcgtca 

ccagacgctt 

caataaatga 

cactctctct 

gcccgcagag 

cggacccttc 

cgagcacgta 

acaaccacgc 

acctcggacg 

aggaacctgt 

cagactcctc 

ttggtcagac 

cagccccctc 

acaataacga 

catggatggg 

acaaccacct 

ttggctacag 

cacgtgactg 

tcaagctctt 

ccaataacct 

tcctcggctc 

cacagtatgg 

actgcctgga 

acacttttga 

tcatgaatcc 

gaaccaccac 

agtctaggaa 

cggataacaa 

gagactctct 

ctcagagcgg 

aaaaggtcat 

agtatggttc 

tcaacacaca 

ggcccatctg 

gtggattcgg 

cgaatccttc 

cgggacacgg 

aatcccgaaa 

gatactaatg 

ctgtaattgc 

cgtatttctt 

attaactaca 

ctcactgagg 

gtgagcgagc 



tctacgtcaa 

agcccaaacg 

tcaactacgc 

tgtttccctg 

gacagaaaga 

aaaaggcgta 

gcactgcctg 

tttaaatcag 

gaaggaataa 

cggcataagg 

aacggactcg 

caaagcctac 

cgacgcggag 

agcagtcttc 

taagacggct 

ctcgggaacc 

tggagacgca 

tggtctggga 

gggcgccgac 

cgacagagtc 

ctacaaacaa 

caccccttgg 

gcaaagactc 

taacattcaa 

taccagcacg 

ggcgcatcaa 

atacctcacc 

gtactttcct 

ggacgttcct 

tctcatcgac 

gcagtcaagg 

ctggcttcct 

caacagtgaa 

ggtgaatccg 

ggttctcatc 

gattacagac 

tgtatctacc 

aggcgttctt 

ggcaaagatt 

acttaaacac 

gaccaccttc 

tcagcgtgga 

ttcagtacac 

gcgtgtattc 

ttgttaatca 

tcttatctag 

aggaacccct 

ccgggcgacc 

gagcgcgcag 



aaagggtgga 

ggtgcgcgag 

agacaggtac 

cagacaatgc 

ctgtttagag 

tcagaaactg 

cgatctggtc 

gtatggctgc 

gacagtggtg 

acgacagcag 

acaagggaga 

gaccggcagc 

tttcaggagc 

caggcgaaaa 

ccgggaaaaa 

ggaaaggcgg 

gactcagtac 

actaatacga 

ggagtgggta 

atcaccacca 

atttccagcc 

gggtattttg 

atcaacaaca 

gtcaaagagg 

gttcaggtgt 

ggatgcctcc 

ctgaacaacg 

tctcagatgc 

ttccacagca 

cagtacctgt 

cttcagtttt 

ggaccctgtt 

tactcgtgga 

gccatggcaa 

tttgggaagc 

gaagaggaaa 

aacctccaga 

ccaggcatgg 

ccacacacgg 

cctcctccac 

agtgcggcaa 

gatcgagtgg 

ttccaactac 

agagcctcgc 

ataaaccgtt 

tttccatggc 

agtgatggag 

aaaggtcgcc 

agagggagtg 



gccaagaaaa 

tcagttgcgc 

caaaacaaat 

gagagaatga 

tgctttcccg 

tgctacattc 

aatgtggatt 

cgatggttat 

gaagctcaaa 

gggtcttgtg 

gccggtcaac 

tcgacagcgg 

gccttaaaga 

agagggttct 

agaggccggt 

gccagcagcc 

ctgaccccca 

tggctacagg 

attcctccgg 

gcacccgaac 

aatcaggagc 

acttcaacag 

actggggatt 

tcacgcagaa 

ttactgactc 

cgccgttccc 

ggagtcaggc 

tgcgtaccgg 

gctacgctca 

attacttgag 

ctcaggccgg 

accgccagca 

ctggagctac 

gccacaagga 

aaggctcaga 

tcggaacaac 

gaggcaacag 

tctggcagga 

acggacattt 

agattctcat 

agtttgcttc 

gagctgcaga 

aacaagtctg 

cccattggca 

taattcgttt 

tacgtagata 

ttggccactc 

cgacgcccgg 

gccaa 



<210> 47 
<211> 4694 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

gtggcactcc cccccctgtc gcgttcgctc gttcgctggc tcgattgggg gggtggcagc 
tcaaagagct gccagacgac ggccctctgg gccgtcgccc ccccaatcga gccagcgaac 
gagcgaacgc gacagggggg ggagtgccac actctctagc aagggggttt tgtaggtggt 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4675 



60 
120 
180 
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gatgtcattg 

atgtgtgtta 

taaaaggggt 

cgctgccatg 

cctgcctgga 

tcccgagtca 

aattcgccgc 

tgtgcagttt 

catctcttct 

ggtgttccag 

gggcggagcc 

ccaaccagag 

tctggaggag 

agaggcatct 

gaaatacatg 

gattcaggag 

gattaaagcc 

ctatctcgtg 

tgatctcaac 

cgcctttgga 

catcgcggaa 

gaactttccc 

gaccagcaag 

tcaaaaatgt 

taacatgtgt 

agaccgcatg 

caagcaggaa 

cgagtttatg 

cccactggat 

agttgttcct 

tctcaactgg 

aacgggggaa 

taatgctaca 

agattttaat 

ttttgttgac 

cggccttgag 

aggtcttgtg 

tcctgtcaat 

tgaggcgggc 

actcgcttct 

gaggattctc 

aaagaggcct 

aggcaaacag 

gcctccccca 

agctggcgga 

tgattggcat 

ctgggtcctg 

cgacaccttc 

ccacttctcg 

aagcatgcaa 

gacgaccgta 

gctcccgtac 

gttcatggtg 

gacagacaga 

aaacaacttt 

cagccagagc 

gtctaccacc 

gaccaaaaca 

gcagagattc 

tctgctccat 

aacggccatg 

tgcggggccc 

agaagatgaa 



ttgatgtcat 

tccaatagga 

gagtgaacga 

gctaccttct 

atttctgaca 

gatttgaatt 

gtgttcctgt 

gaaaagggat 

atggtccttg 

aacattgagc 

aataaggtgg 

cttcagtggg 

cgcaaacggc 

tcccagaggg 

gcgctggtaa 

aatcaggaga 

gcgcttgaca 

ggacagactg 

ggctacgacc 

aagaggaaca 

gccatctctc 

tttaatgact 

gtggtggaac 

aaatcctctg 

gtggtggtgg 

ttcagatttg 

gtcaaagact 

gttcccaaga 

gacgtcacca 

gagacgcctc 

tcttccaggt 

tgtgacgagt 

cattgtcaaa 

gattttgatg 

caccctccag 

gcgggtcccc 

cttcctgggt 

tttgctgacg 

gataaccctt 

gacacttctt 

gaacctcttg 

ctagagcaga 

cctgccagaa 

gaaggaccat 

aatggtggcg 

tgcgattcca 

ccgacctaca 

aacggattct 

ccaagagact 

gtccgcatct 

tccaacaacc 

gtgatggatg 

cctcagtacg 

aatgccttct 

gagatggtgt 

ctggataggc 

tctggaggaa 

aacttttctg 

tccaagactg 

tatgagacca 

gcaaccgcag 

aacatcaccg 

cttagggcca 



tatagttgtc 

tgaaagcgcg 

gcccgccgcc 

atgaagtcat 

actttgtaga 

tgadtctgat 

acgagtggaa 

ctgaatattt 

gccgctacgt 

cgcggattaa 

tggattctgg 

cgtggactaa 

tcgtcgctca 

acgtttcggc 

gctggctggt 

gctacctgtc 

acgcgtcaaa 

ttccagagga 

cggcatacgc 

ccgtctggct 

acaccgtgcc 

gtgtggaaaa 

ccgccaaggc 

tacaagtaga 

atgggaactc 

aactcatgcg 

tttttgcttg 

aagtggcggg 

ataccaacta 

gcagttcaga 

atgaatgcag 

gtgaatattt 

tttgtcacgc 

actgtaataa 

attggttgga 

cgaaacccaa 

acaagtatct 

aggttgcccg 

acctcaagta 

ttgggggaaa 

gcctggttga 

gtcctcaaga 

agagactcaa 

cttccggagc 

atgcgggaca 

cttggtcaga 

acaaccacct 

ccaccccctg 

ggcaaaggct 

tcaacatcca 

tcaccagcac 

caggtcagga 

ggtactgcgg 

actgtctgga 

acaagtttga 

tgatgaaccc 

ctctcaacca 

gctaccgcaa 

ccagtcaaaa 

gaactaccct 

ccaacgacgc 

gcaacaccac 

ccaacccccg 



acgcgatagx 

cgaatgagat 

attctctgct 

tgttcgcgtt 

ctgggtaact 

tgagcagcct 

caaattttcc 

tcatctgcac 

gagtcagatt 

cgactgggtc 

gtatattccc 

cctcgaagag 

gtttcagctt 

tgacccggtc 

ggaacatggc 

cttcaactcc 

aattatgagt 

catttctgaa 

gggctctgtt 

gtatggaccc 

cttttatggc 

aatgttgatc 

catcttgggg 

ctctaccccg 

cacgaccttt 

gcggctcccg 

ggcaaaggtc 

aactgagagg 

taaaagtccg 

cgtgcctgta 

atgtgactat 

gaatcggggc 

tgttcctcca 

agagcagtaa 

atcgatcggc 

ggccaatcaa 

tggtcctggg 

agagcacgac 

caaccacgcg 

ccttgggaag 

gacgccggat 

gccagactcc 

ctttgacgac 

tatgtctact 

aggtgccgag 

gagccacgtc 

gtacctgcgg 

gggatacttt 

catcaacaac 

agttaaggag 

ggtccagatc 

gggcagcttg 

actggtaacc 

gtactttccc 

aaacgtgccc 

gctgctggac 

gggcaattca 

aaactggctc 

ctacaagatt 

cgacggaaga 

caccgacttc 

cacagatgcc 

ggacactgac 



taatgartaa 

ctcgcgagac 

ctggactgct 

ccatttgatg 

ggtcaaattt 

cagctgacgg 

aagcaggaga 

acgctcgtgg 

cgcgcccagc 

gccatcacca 

gcctacctgc 

tataaattgg 

gagtcctcgc 

atcaagagca 

atcacttccg 

acgggaaact 

ctgaccaaat 

aacagaatct 

ctctacggct 

gcgaccaccg 

tgtgtgaact 

tggtgggagg 

gggtctagag 

gtgattatca 

gaacaccagc 

ccagattttg 

aaccaggtgc 

gcggagactt 

gagaagcggg 

gagcccgctc 

catgctaaat 

aaaaatggct 

tgggaaaagg 

ataaagtgag 

gacggctttc 

cagaagcaag 

aacggccttg 

ctctcctacc 

gacgcagagt 

gctgttttcc 

aaaacggcgc 

tcgagcggag 

gaacctggag 

gagactgaaa 

ggagtgggta 

accaccacct 

ctcggctcga 

gactttaacc 

cactggggac 

gtcacgacgt 

tttgcggaca 

cctcctttcc 

ggaggcagct 

agccagatgc 

ttccactcca 

cagtacctgt 

gccaccaact 

ccggggccca 

ccccagggaa 

tggagcaatt 

tctcaggccc 

aataacctga 

ctgtttggcc 



cagtcatgtg 

ttccggggta 

agaggaccct 

tggaagagca 

gggagctgcc 

tggctgacag 

gcaaattctt 

agacctccgg 

tggtgaaggt 

aggtaaagaa 

tgccgaaggt 

ccgccctcaa 

agcgctcgca 

agacttccca 

agaagcagtg 

ctcggagcca 

ctgcctcaga 

ggcagatttt 

ggtgcactcg 

gaaagaccaa 

ggactaatga 

agggaaagat 

tacgagtgga 

cctccaatac 

agccgctgga 

gcaagattac 

cggtgactca 

ctagaaaacg 

cccggctctc 

ctctgcgacc 

ttgactctgt 

gtatctttca 

aaaatgtgtc 

tagtcatgtc 

gtgaatttct 

ataacgctcg 

ataagggcga 

agaaacagct 

ttcaggagaa 

aggctaaaaa 

ctgcggcaaa 

ttggcaagaa 

ccggagacgg 

tgcgtgcagc 

atgcctccgg 

caacccgcac 

gcaacgccag 

gcttccactg 

tgcgccccaa 

ctaacgggga 

gcacgtacga 

ccaacgacgt 

ctcaaaacca 

tgagaaccgg 

tgtacgctca 

gggagctcca 

ttgccaagct 

tgatgaagca 

gaaacaacag 

ttgccccggg 

agctcatctt 

tgttcacttc 

acctggcaac 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
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caaccagcaa 
cccgggaatg 
tccacacacg 
cccgcctcca 
ctctccggcc 
aatagaatgg 
gtccaactac 
agagcccagg 
ataccctcaa 
gtacaacagc 
tgtcgcgttc 
cgacggccct 
ggggggagtg 



aacgccacca 
gtgtggcagg 
gatggacact 
caaatattca 
agaatcaaca 
gaaatccaga 
ggagcacagg 
gccattggat 
taaaccgtgt 
ttacaacaac 
gctcgttcgc 
ctgggccgtc 
ccac 



ccgttcctac 
acagagacat 
ttcacccgtc 
tcaaaaacac 
gcttcatcac 
aggagcggtc 
actcgcttct 
cccgatacct 
attcgtttca 
aacaaaaccc 
tggctcgatt 
gcccccccaa 



cgxagacgac 
ttactaccaa 
tcctctcatt 
tcctgtaccc 
ccagtacagc 
caagagatgg 
ctgggctccc 
caccaaccac 
gtaaaatact 
ccttgctaga 

gggggggtgg 

tcgagccagc 



gtggacggag 
gggcccattt 
ggcggatttg 
gccaatcccg 
accggacagg 
aacccagagg 
gacaacgccg 
ctctagccca 
gcctcttgtg 
gagtgtggca 
cagctcaaag 
gaacgagcga 



tcggcgtgta 
gggccaaaat 
gactgaaaag 
caacgacctt 
tggctgtcaa 
tccagttcac 
gagcctacaa 
attctgttgc 
gtcattcggc 
ctcccccccc 
agctgccaga 
acgcgacagg 



<210> 48 
<211> 1833 
<212> DNA 

<213> Artificial Sequence 

< 22 0> ... . - 

<223> Description of Artificial Sequence; note = 

synthetic construct 



<400> 48 

atggctacct 

ggaatttctg 

tcagatttga 

cgcgtgttcc 

tttgaaaagg 

tctatggtcc 

cagaacattg 

gccaataagg 

gagcttcagt 

gagcgcaaac 

tcttcccaga 

atggcgctgg 

gagaatcagg 

gccgcgcttg 

gtgggacaga 

aacggctacg 

ggaaagagga 

gaagccatct 

ccctttaatg 

aaggtggtgg 

tgtaaatcct 

tgtgtggtgg 

atgttcagat 

gaagtcaaag 

atggttccca 

gatgacgtca 

cctgagacgc 

tggtcttcca 

gaatgtgacg 

acacattgtc 

aatgattttg 



tctatgaagt 
acaactttgt 
atttgactct 
tgtacgagtg 
gatctgaata 
ttggccgcta 
agccgcggat 
tggtggattc 
gggcgtggac 
ggctcgtcgc 
gggacgtttc 
taagctggct 
agagctacct 
acaacgcgtc 
ctgttccaga 
acccggcata 
acaccgtctg 
ctcacaccgt 
actgtgtgga 
aacccgccaa 
ctgtacaagt 
tggatgggaa 
ttgaactcat 
acttttttgc 
agaaagtggc 
ccaataccaa 
ctcgcagttc 
ggtatgaatg 
agtgtgaata 
aaatttgtca 
atgactgtaa 



cattgttcgc 
agactgggta 
gattgagcag 
gaacaaattt 
ttttcatctg 
cgtgagtcag 
taacgactgg 
tgggtatatt 
taacctcgaa 
tcagtttcag 
ggctgacccg 
ggtggaacat 
gtccttcaac 
aaaaattatg 
ggacatttct 
cgcgggctct 
gctgtatgga 
gcccttttat 
aaaaatgttg 
ggccatcttg 
agactctacc 
ctccacgacc 
gcggcggctc 
ttgggcaaag 
gggaactgag 
ctataaaagt 
agacgtgcct 
cagatgtgac 
tttgaatcgg 
cgctgttcct 
taaagagcag 



gttccatttg 
actggtcaaa 
cctcagctga 
tccaagcagg 
cacacgctcg 
attcgcgccc 
gtcgccatca 
cccgcctacc 
gagtataaat 
cttgagtcct 
gtcatcaaga 
ggcatcactt 
tccacgggaa 
agtctgacca 
gaaaacagaa 
gttctctacg 
cccgcgacca 
ggctgtgtga 
atctggtggg 
ggggggtcta 
ccggtgatta 
tttgaacacc 
ccgccagatt 
gtcaaccagg 
agggcggaga 
ccggagaagc 
gtagagcccg 
tatcatgcta 
ggcaaaaatg 
ccatgggaaa 
taa 



atgtggaaga 
tttgggagct 
cggtggctga 
agagcaaatt 
tggagacctc 
agctggtgaa 
ccaaggtaaa 
tgctgccgaa 
tggccgccct 
cgcagcgctc 
gcaagacttc 
ccgagaagca 
actctcggag 
aatctgcctc 
tctggcagat 
gctggtgcac 
ccggaaagac 
actggactaa 
aggagggaaa 
gagtacgagt 
tcacctccaa 
agcagccgct 
ttggcaagat 
tgccggtgac 
cttctagaaa 
gggcccggct 
ctcctctgcg 
aatttgactc 
gctgtatctt 
aggaaaatgt 



gcacctgcct 
gcctcccgag 
cagaattcgc 
ctttgtgcag 
cggcatctct 
ggtggtgttc 
gaagggcgga 
ggtccaacca 
caatctggag 
gcaagaggca 
ccagaaatac 
gtggattcag 
ccagattaaa 
agactatctc 
ttttgatctc 
tcgcgccttt 
caacatcgcg 
tgagaacttt 
gatgaccagc 
ggatcaaaaa 
tactaacatg 
ggaagaccgc 
taccaagcag 
tcacgagttt 
acgcccactg 
ctcagttgtt 
acctctcaac 
tgtaacgggg 
tcataatgct 
gtcagatttt 



4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4694 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1833 



<210> 49 
<211> 610 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note 
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synthetic construct 

Met°Ala 9 Thr Phe Tyr Glu val lie val Arg Val Pro Phe Asp Val Glu 

1 5 10 15 

Glu His Leu Pro Gly He Ser Asp Asn Phe val Asp Trp val Thr Gly 

20 25 30 

Gin lie Trp Glu Leu Pro Pro Glu Ser Asp Leu Asn Leu Thr Leu lie 

35 40 45 

Glu Gin Pro Gin Leu Thr Val Ala Asp Arg He Arg Arg Val Phe Leu 

50 55 60 

Tyr Glu Trp Asn Lys Phe Ser Lys Gin Glu ser Lys Phe Phe Val Gin 
65 70 75 80 

Phe Glu Lys Gly Ser Glu Tyr Phe His Leu His Thr Leu val Glu Thr 

85 90 95 

Ser Gly lie ser Ser Met val Leu Gly Arg Tyr val ser Gin He Arg 

100 105 HO 

Ala Gin Leu Val Lys Val val Phe Gin Asn lie Glu Pro Arg He Asn 

115 120 125 

Asp Trp- val Ala He Thr lys val Lys Lys Gly Gly Ala Asn Lys val 

130 135 140 

val Asp Ser Gly Tyr He Pro Ala Tyr Leu Leu Pro Lys val Gin Pro 
145 150 155 160 

Glu Leu Gin Trp Ala Trp Thr Asn Leu Glu Glu Tyr Lys Leu Ala Ala 

165 170 175 

Leu Asn Leu Glu Glu Arg Lys Arg Leu val Ala Gin Phe Gin Leu Glu 

180 185 190 

Ser Ser Gin Arg Ser Gin Glu Ala ser Ser Gin Arg Asp val Ser Ala 

195 ~ 200 205 

Asd pro val lie Lys ser Lys Thr ser Gin Lys Tyr Met Ala Leu Val 

210 215 220 

ser Trp Leu Val Glu His Gly He Thr Ser Glu Lys Gin Trp He Gin 
225 230 235 240 

Glu Asn Gin Glu ser Tyr Leu Ser Phe Asn ser Thr Gly Asn Ser Arg 

245 250 255 

ser Gin He Lys Ala Ala Leu Asp Asn Ala ser Lys lie Met ser Leu 

260 265 270 

Thr Lys Ser Ala Ser Asp Tyr Leu Val Gly Gin Thr Val Pro Glu Asp 

275 280 285 

lie ser Glu Asn Arg He Trp Gin lie Phe Asp Leu Asn Gly Tyr Asp 

290 295 300 

pro Ala Tyr Ala Gly Ser val Leu Tyr Gly Trp Cys Thr Arg Ala Phe 
305 310 315 320 

Gly Lys Arg Asn Thr Val Trp Leu Tyr Gly Pro Ala Thr Thr Gly Lys 

325 330 335 

Thr Asn He Ala Glu Ala He Ser His Thr Val Pro Phe Tyr Gly cys 

340 345 350 

Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp cys Val Glu Lys 

355 360 365 

Met Leu He Trp Trp Glu Glu Gly Lys Met Thr ser Lys Val Val Glu 

370 375 380 

Pro Ala Lys Ala He Leu Gly Gly ser Arg val Arg val Asp Gin Lys 
385 390 395 400 

Cys Lys Ser ser val Gin val Asp ser Thr Pro val He He Thr ser 

405 410 415 

Asn Thr Asn Met cys val Val Val Asp Gly Asn Ser Thr Thr Phe Glu 

420 425 430 

His Gin Gin Pro Leu Glu Asp Arg Met Phe Arg Phe Glu Leu Met Arg 

435 440 445 

Arg Leu Pro Pro Asp Phe Gly Lys He Thr Lys Gin Glu val Lys Asp 

450 455 460 

Phe Phe Ala Trp Ala Lys Val Asn Gin Val Pro val Thr His Glu Phe 
465 470 475 480 
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Met val Pro i_ys Lys val Ala Gly Thr Glu Arg Ala Glu Thr ser Arg 

485 490 495 

Lys Arg Pro Leu Asp Asp val Thr Asn Thr Asn Tyr Lys Ser Pro Glu 

Lys Arg Ala Arg Leu ser val val Pro Glu Thr Pro Arg Ser ser Asp 

515 520 525 

val Pro val Glu Pro Ala Pro Leu Arg Pro Leu Asn Trp Ser Ser Arg 

530 535 " 540 

Tyr Glu Cys Arg Cys Asp Tyr His Ala Lys Phe Asp Ser Val Thr Gly 
545 550 555 560 

Glu Cys Asp Glu Cys Glu Tyr Leu Asn Arg Gly Lys Asn Gly cys lie 

565 570 575 

Phe His Asn Ala Thr His cys Gin lie cys His Ala Val Pro Pro Trp 

580 585 590 

Glu Lys Glu Asn Val Ser Asp Phe Asn Asp Phe Asp Asp Cys Asn Lys 
595 600 605 

Glu Gin 
610 

<210> 50 
<211> U73 
<212> DNA 

<213> Artificial sequence 

<220> . n 

<223> Description of Artificial Sequence; note = 
synthetic construct 

atggcgctgg taagctggct ggtggaacat ggcatcactt ccgagaagca gtggattcag 60 

qagaatcagg agagctacct gtccttcaac tccacgggaa actctcggag ccagattaaa 120 

gccgcgcttg acaacgcgtc aaaaattatg agtctgacca aatctgcctc agactatctc 180 

gtgggacaga ctgttccaga ggacatttct gaaaacagaa tctggcagat ttttgatctc 240 

aacggctacg acccggcata cgcgggctct gttctctacg gctggtgcac tcgcgccttt 300 

ggaaagagga acaccgtctg gctgtatgga cccgcgacca ccggaaagac caacatcgcg 360 

gaagccatct ctcacaccgt gcccttttat ggctgtgtga actggactaa tgagaacttt 420 

ccctttaatg actgtgtgga aaaaatgttg atctggtggg aggagggaaa gatgaccagc 480 

aaggtggtgg aacccgccaa ggccatcttg ggggggtcta gagtacgagt ggatcaaaaa 540 

tgtaaatcct ctgtacaagt agactctacc ccggtgatta tcacctccaa tactaacatg 600 

tgtgtggtgg tggatgggaa ctccacgacc tttgaacacc agcagccgct ggaagaccgc 660 

atgttcagat ttgaactcat gcggcggctc ccgccagatt ttggcaagat taccaagcag 720 

gaagtcaaag acttttttgc ttgggcaaag gtcaaccagg tgccggtgac tcacgagttt 780 

atggttccca agaaagtggc gggaactgag agggcggaga cttctagaaa acgcccactg 840 

gatgacgtca ccaataccaa ctataaaagt ccggagaagc gggcccggct ctcagttgtt 900 

cctgagacgc ctcgcagttc agacgtgcct gtagagcccg ctcctctgcg acctctcaac 960 

tggtcttcca ggtatgaatg cagatgtgac tatcatgcta aatttgactc tgtaacgggg 1020 

gaatgtgacg agtgtgaata tttgaatcgg ggcaaaaatg gctgtatctt tcataatgct 1080 

acacattgtc aaatttgtca cgctgttcct ccatgggaaa aggaaaatgt gtcagatttt 1140 

aatgattttg atgactgtaa taaagagcag taa li/: > 

<210> 51 
<211> 390 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

SetAlspLeu val Ser Trp Leu Val Glu His Gly lie Thr ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asn Gin Glu Ser Tyr Leu Ser Phe Asn ser Thr 
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20 25 30 

Gly Asn ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Ser Lys 

35 40 45 

lie Met Ser Leu Thr Lys ser Ala ser Asp Tyr Leu val Gly Gin Thr 

50 55 60 

val Pro Glu Asp lie ser Glu Asn Arg lie Trp Gin lie Phe Asp Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Ala Tyr Ala Gly ser val Leu Tyr Gly Trp cys 

85 90 95 

Thr Arg Ala Phe Gly Lys Arg Asn Thr Val Trp Leu Tyr Gly Pro Ala 

100 " 105 . 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie ser His Thr Val pro 

115 120 125 

Phe Tyr Gly cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cvs val Glu Lys Met Leu lie Trp Trp Glu Glu Gly Lys Met Thr ser 
145 150 155 160 

Lys val val Glu Pro Ala Lys Ala He Leu Gly Gly ser Arg val Arg m 

165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Val Gin Val Asp Ser Thr Pro Val 

180 185 190 

lie lie Thr Ser Asn Thr Asn Met Cys val Val Val Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Glu Asp Arg Met Phe Arg Phe 

210 215 220 

Glu Leu Met Arg Arg Leu Pro Pro Asp Phe Gly Lys He Thr Lys Gin 
225 230 235 240 

Glu val Lys Asp Phe Phe Ala Trp Ala Lys val Asn Gin val Pro val 

245 250 255 

Thr His Glu Phe Met val Pro Lys Lys val Ala Gly Thr Glu Arg Ala 

260 265 270 

Glu Thr Ser Arg Lys Arg Pro Leu Asp Asp val Thr Asn Thr Asn Tyr 

275 280 285 

Lvs Ser Pro Glu Lys Arg Ala Arg Leu Ser Val val Pro Glu Thr Pro 

290 295 300 

Arq Ser Ser Asp Val pro Val Glu Pro Ala Pro Leu Arg Pro Leu Asn 
305 310 315 . 320 

Trp Ser Ser Arg Tyr Glu Cys Arg cys Asp Tyr His Ala Lys Phe Asp 

325 330 335 

Ser val Thr Gly Glu Cys Asp Glu Cys Glu Tyr Leu Asn Arg Gly Lys 

340 345 350 

Asn Gly Cys lie Phe His Asn Ala Thr His cys Gin lie cys His Ala 

355 360 365 

val Pro Pro Trp Glu Lys Glu Asn val Ser Asp Phe Asn Asp Phe Asp 

370 375 380 

Asp Cys Asn Lys Glu Gin 
385 390 

<210> 52 
<211> 2211 
<212> DNA 

<213> Artificial Sequence 



<220> . , 

<223> Description of Artificial Sequence; note = 
synthetic construct 



60 



<400> 52 

atgtcttttg ttgaccaccc tccagattgg ttggaatcga tcggcgacgg ctttcgtgaa 

tttctcggcc ttgaggcggg tcccccgaaa cccaaggcca atcaacagaa gcaagataac izu 

gctcgaggtc ttgtgcttcc tgggtacaag tatcttggtc ctgggaacgg ccttgataag 180 

ggcgatcctg tcaattttgc tgacgaggtt gcccgagagc acgacctctc ctaccagaaa 240 

cagcttgagg cgggcgataa cccttacctc aagtacaacc acgcggacgc agagtttcag 3UU 
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gagaaactcg 

aaaaagagga 

gcaaaaaaga 

aagaaaggca 

gacgggcctc 

gcagcagctg 

tccggtgatt 

cgcacctggg 

gccagcgaca 

cactgccact 

cccaaaagca 

ggggagacga 

tacgagctcc 

gacgtgttca 

aaccagacag 

accggaaaca 

gctcacagcc 

ctccagtcta 

aagctgacca 

aagcagcaga 

aacagtctgc 

ccgggaacgg 

atctttgcgg 

acttcagaag 

gcaaccaacc 

gtgtacccgg 

aaaattccac 

aaaagcccgc 

accttctctc 

gtcaaaatag 

ttcacgtcca 

tacaaagagc 



cttctgacac 
ttctcgaacc 
ggcctctaga 
aacagcctgc 
ccccagaagg 
gcggaaatgg 
ggcattgcga 
tcctgccgac 
ccttcaacgg 
tctcgccaag 
tgcaagtccg 
ccgtatccaa 
cgtacgtgat 
tggtgcctca 
acagaaatgc 
actttgagat 
agagcctgga 
ccacctctgg 
aaacaaactt 
gattctccaa 
tccattatga 
ccatggcaac 
ggcccaacat 
atgaacttag 
agcaaaacgc 
gaatggtgtg 
acacggatgg 
ctccacaaat 
cggccagaat 
aatgggaaat 
actacggagc 
ccagggccat 



ttcttttggg 
tcttggcctg 
gcagagtcct 
cagaaagaga 
accatcttcc 
tggcgatgcg 
ttccacttgg 
ctacaacaac 
attctccacc 
agactggcaa 
catcttcaac 
caacctcacc 
ggatgcaggt 
gtacgggtac 
cttctactgt 
ggtgtacaag 
taggctgatg 
aggaactctc 
ttctggctac 
gactgccagt 
gaccagaact 
cgcagccaac 
caccggcaac 
ggccaccaac 
caccaccgtt 
gcaggacaga 
acactttcac 
attcatcaaa 
caacagcttc 
ccagaaggag 
acaggactcg 
tggatcccga 



ggaaaccttg 
gttgagacgc 
caagagccag 
ctcaactttg 
ggagctatgt 
ggacaaggtg 
tcagagagcc 
cacctgtacc 
ccctggggat 
aggctcatca 
atccaagtta 
agcacggtcc 
caggagggca 
tgcggactgg 
ctggagtact 
tttgaaaacg 
aacccgctgc 
aaccagggca 
cgcaaaaact 
caaaactaca 
accctcgacg 
gacgccaccg 
accaccacag 
ccccgggaca 
cctaccgtag 
gacatttact 
ccgtctcctc 
aacactcctg 
atcacccagt 
cggtccaaga 
cttctctggg 
tacctcacca 



ggaaggctgt 
cggataaaac 
actcctcgag 
acgacgaacc 
ctactgagac 
ccgagggagt 
acgtcaccac 
tgcggctcgg 
actttgactt 
acaaccactg 
aggaggtcac 
agatctttgc 
gcttgcctcc 
taaccggagg 
ttcccagcca 
tgcccttcca 
tggaccagta 
attcagccac 
ggctcccggg 
agattcccca 
gaagatggag 
acttctctca 
atgccaataa 
ctgacctgtt 
acgacgtgga 
accaagggcc 
tcattggcgg 
tacccgccaa 
acagcaccgg 
gatggaaccc 
ctcccgacaa 
accacctcta 



tttccaggct 
ggcgcctgcg 
cggagttggc 
tggagccgga 
tgaaatgcgt 
gggtaatgcc 
cacctcaacc 
ctcgagcaac 
taaccgcttc 
gggactgcgc 
gacgtctaac 
ggacagcacg 
tttccccaac 
cagctctcaa 
gatgctgaga 
ctccatgtac 
cctgtgggag 
caactttgcc 
gcccatgatg 
gggaagaaac 
caattttgcc 
ggcccagctc 
cctgatgttc 
tggccacctg 
cggagtcggc 
catttgggcc 
atttggactg 
tcccgcaacg 
acaggtggct 
agaggtccag 
cgccggagcc 

g 



<210> 53 
<211> 736 
<212> PRT 

<213> Artificial Sequence 

<220> . 
<223> Description of Artificial 
synthetic construct 

<400> 53 

Met ser Phe val Asp His pro Pro 

1 5 
Gly Phe Arg Glu Phe Leu Giy Leu 
20 

Ala Asn Gin Gin Lys Gin Asp Asn 

35 40 
Tyr Lys Tyr Leu Gly Pro Gly Asn 

Asn Phe Ala Asp Glu Val Ala Arg 

65 ^ 70 

Gin Leu Glu Ala Gly Asp Asn Pro 

Ala Glu Phe Gin Glu Lys Leu Ala 
100 

Leu Gly Lys Ala val Phe Gin Ala 
115 120 
Gly Leu Val Glu Thr Pro Asp Lys 

130 135 
Pro Leu Glu Gin Ser Pro Gin Glu 
145 150 



Sequence; note = 



Asp Trp 

10 
Glu Ala 
25 

Ala Arg 

Gly Leu 

Glu His 

Tyr Leu 
90 
ser Asp 
105 

Lys Lys 
Thr Ala 
Pro Asp 



Leu Glu ser lie 
Gly Pro 
Gly Leu 



Asp Lys 
60 

Asp Leu 
75 

Lys Tyr 

Thr Ser 

Arg He 

pro Ala 
140 
ser Ser 
155 



pro Lys 
30 

val Leu 
45 
Gly Asp 



ser Tyr 
Asn His 



Phe Gly 
110 
Leu Glu 
125 

Ala Lys 



Gly Asp 
15 

Pro Lys 

Pro Gly 

Pro val 

Gin Lys 
80 
Ala Asp 
95 

Gly Asn 



pro Leu 
Lys Arg 
Ser Gly val Gl> 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2211 



Gly 
160 
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Lys Lys Gly 

pro Gly Ala 

Met ser Thr 
195 

Asp Ala Gly 

210 
His cys Asp 
225 

Arg Thr Trp 

Gly ser Ser 

Gly Tyr Phe 
275 

Trp Gin Arg 

290 
Gin Val Arg 
305 

Gly Glu Thr 



Lys Gin 
165 
Gly Asp 
180 

Glu Thr 

Gin Gly 

Ser Thr 

val Leu 
245 
Asn Ala 
260 

Asp Phe 
Leu lie 
lie Phe 



Thr val 
325 

Ala Asp ser Thr Tyr 
340 

Pro Pro 



Gly ser Leu 
355 

Gly Tyr cys 

370 
Arg Asn Ala 
385 

Thr Gly Asn 

His Ser Met 

Leu Leu Asp 
435 

Thr Leu Asn 

450 
Thr Asn Phe 
465 

Lys Gin Gin 

Gin Gly Arg 

Asp Gly Arg 
515 

Ala Asn Asp 

530 
pro Asn lie 
545 

Thr Ser Glu 



Gly Leu 

phe Tyr 

Asn Phe 
405 
Tyr Ala 
420 

Gin Tyr 

Gin Gly 

ser Gly 

Arg Phe 
485 
Asn Asn 
500 

Trp Ser 
Ala Thr 
Thr Gly 



Pro Ala Arg 

Gly Pro Pro 

Glu Met Arg 
200 

Ala Glu Gly 

215 
Trp Ser Glu 
230 

pro Thr Tyr 

Ser Asp Thr 

Asn Arg Phe 
280 

Asn Asn His 

295 
Asn He Gin 
310 

Ser Asn Asn 

Glu Leu Pro 

Phe Pro Asn 
360 

Val Thr Gly 

375 
Cys Leu Glu 
390 

Glu Met val 

His ser Gin 

Leu Trp Glu 
440 

Asn ser Ala 

455 
Tyr Arg Lys 
470 

Ser Lys Thr 

Ser Leu Leu 

Asn Phe Ala 
520 

Asp Phe Ser 

535 
Asn Thr Thr 
550 

Leu Arg Ala 



Lys Arg 
170 
pro Glu 
185 
Ala Ala 

val Gly 

ser His 

Asn Asn 
250 
Phe Asn 
265 

His Cys 

Trp Gly 

val Lys 

Leu Thr 
330 
Tyr val 
345 

Asp val 

Gly ser 

Tyr Phe 

Tyr Lys 
410 
Ser Leu 
425 

Leu Gin 
Thr Asn 
Asn Trp 



Leu Asn Phe 
Gly Pro ser 



Asp Asp Glu 
175 

ser Gly Ala 
190 

Asn Gly Gly 



Ala Gly Gly 
205 

Asn Ala ser Gly Asp Trp 

220 
val Thr Thr 
235 

His Leu Tyr 



Gly Phe ser 

His Phe ser 
285 

Leu Arg Pro 

300 
Glu val Thr 
315 

Ser Thr val 

Met Asp Ala 

Phe Met val 
365 

ser Gin Asn 

380 
Pro ser Gin 
395 

Phe Glu Asn 

Asp Arg Leu 

Ser Thr Thr 
445 

phe Ala Lys 

460 
Leu Pro Gly 
475 

Gin Asn Tyr 



Ala ser 
490 

His Tyr Glu Thr Arg 
505 

Pro Gly 



Asp Glu 
565 

Phe Gly His Leu Ala Thr Asn Gin 
580 

val Asp 



val Asp Asp 
595 
Asp Arg Asp 

610 
Thr Asp Gly 
625 

Lys Ser Pro 
Asn Pro Ala 



lie Tyr 

His Phe 

pro Pro 
645 
Thr Thr 



Gly val Gly 
600 
Tyr Gin Gly 

615 
His Pro ser 
630 

Gin He Phe 
Phe Ser Pro 



Gin Ala 

Thr Asp 

Thr Asn 
570 
Gin Asn 
585 

val Tyr 

pro lie 

Pro Leu 

lie Lys 
650 
Ala Arg 



Thr Ala Met 
525 

Gin Leu He 

540 
Ala Asn Asn 
555 

Pro Arg Asp 

Ala Thr Thr 

pro Gly Met 
605 

Trp Ala Lys 

620 
lie Gly Gly 
635 

Asn Thr Pro 
lie Asn Ser 



Thr Ser Thr 
240 

Leu Arg Leu 
255 

Thr Pro Trp 
270 

Pro Arg Asp 

Lys ser Met 

Thr ser Asn 
320 

Gin lie Phe 
335 

Gly Gin Glu 
350 

Pro Gin Tyr 

Gin Thr Asp 

Met Leu Arg 
400 

val Pro Phe 

415 
Met Asn Pro 
430 

ser Gly Gly 

Leu Thr Lys 

pro Met Met 
480 

Lys lie Pro 

495 
Thr Thr Leu 
510 

Ala Thr Ala 

Phe Ala Gly 

Leu Met Phe 
560 

Thr Asp Leu 
575 

val Pro Thr 
590 

Val Trp Gin 

lie Pro His 

Phe Gly Leu 
640 

val Pro Ala 

655 
Phe lie Thr 
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660 665 670 

Gin Tyr Ser Thr Gly Gin Val Ma val Lys lie Glu Trp Glu He Gin 

675 * 680 685 

Lys Glu Arg ser Lys Arg Trp Asn Pro Glu val Gin Phe Thr Ser Asn 

690 ' 695 700 

Tvr Gly Ala Gin Asp ser Leu Leu Trp Ala Pro Asp Asn Ala Gly Ala 
705 710 715 720 

Tyr Lys Glu Pro Arg Ala lie Gly Ser Arg Tyr Leu Thr Asn His Leu 



<210> 54 
<211> 1803 
<212> DNA 

<213> Artificial Sequence 

< 220> , • n 

<223> Description of Artificial Sequence; note = 

synthetic construct 



<400> 54 

acggcgcctg 

agcggagttg 

cctggagccg 

actgaaatgc 

gtgggtaatg 

accacctcaa 

ggctcgagca 

tttaaccgct 

tggggactgc 

acgacgtcta 

gcggacagca 

cctttcccca 

ggcagctctc 

cagatgctga 

cactccatgt 

tacctgtggg 

accaactttg 

gggcccatga 

cagggaagaa 

agcaattttg 

caggcccagc 

aacctgatgt 

tttggccacc 

gacggagtcg 

cccatttggg 

ggatttggac 

aatcccgcaa 

ggacaggtgg 

ccagaggtcc 

aacgccggag 

tag 



cggcaaaaaa 
gcaagaaagg 
gagacgggcc 
gtgcagcagc 
cctccggtga 
cccgcacctg 
acgccagcga 
tccactgcca 
gccccaaaag 
acggggagac 
cgtacgagct 
acgacgtgtt 
aaaaccagac 
gaaccggaaa 
acgctcacag 
agctccagtc 
ccaagctgac 
tgaagcagca 
acaacagtct 
ccccgggaac 
tcatctttgc 
tcacttcaga 
tggcaaccaa 
gcgtgtaccc 
ccaaaattcc 
tgaaaagccc 
cgaccttctc 
ctgtcaaaat 
agttcacgtc 
cctacaaaga 



gaggcctcta 
caaacagcct 
tcccccagaa 
tggcggaaat 
ttggcattgc 
ggtcctgccg 
caccttcaac 
cttctcgcca 
catgcaagtc 
gaccgtatcc 
cccgtacgtg 
catggtgcct 
agacagaaat 
caactttgag 
ccagagcctg 
taccacctct 
caaaacaaac 
gagattctcc 
gctccattat 
ggccatggca 
ggggcccaac 
agatgaactt 
ccagcaaaac 
gggaatggtg 
acacacggat 
gcctccacaa 
tccggccaga 
agaatgggaa 
caactacgga 
gcccagggcc 



gagcagagtc 
gccagaaaga 
ggaccatctt 
ggtggcgatg 
gattccactt 
acctacaaca 
ggattctcca 
agagactggc 
cgcatcttca 
aacaacctca 
atggatgcag 
cagtacgggt 
gccttctact 
atggtgtaca 
gataggctga 
ggaggaactc 
ttttctggct 
aagactgcca 
gagaccagaa 
accgcagcca 
atcaccggca 
agggccacca 
gccaccaccg 
tggcaggaca 
ggacactttc 
atattcatca 
atcaacagct 
atccagaagg 
gcacaggact 
attggatccc 



ctcaagagcc 
gactcaactt 
ccggagctat 
cgggacaagg 
ggtcagagag 
accacctgta 
ccccctgggg 
aaaggctcat 
acatccaagt 
ccagcacggt 
gtcaggaggg 
actgcggact 
gtctggagta 
agtttgaaaa 
tgaacccgct 
tcaaccaggg 
accgcaaaaa 
gtcaaaacta 
ctaccctcga 
acgacgccac 
acaccaccac 
acccccggga 
ttcctaccgt 
gagacattta 
acccgtctcc 
aaaacactcc 
tcatcaccca 
agcggtccaa 
cgcttctctg 
gatacctcac 



agactcctcg 
tgacgacgaa 
gtctactgag 
tgccgaggga 
ccacgtcacc 
cctgcggctc 
atactttgac 
caacaaccac 
taaggaggtc 
ccagatcttt 
cagcttgcct 
ggtaaccgga 
ctttcccagc 
cgtgcccttc 
gctggaccag 
caattcagcc 
ctggctcccg 
caagattccc 
cggaagatgg 
cgacttctct 
agatgccaat 
cactgacctg 
agacgacgtg 
ctaccaaggg 
tctcattggc 
tgtacccgcc 
gtacagcacc 
gagatggaac 
ggctcccgac 
caaccacctc 



<210> 55 
<2U> 600 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; 
synthetic construct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1803 



note 



Thr°Ala 5 pro Ala Ala Lys Lys Arg Pro Leu Glu Gin ser Pro Gin Glu 
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15 10 15 

pro Asp Ser Ser Ser Gly Val Gly Lys Lys Gly Lys Gin Pro Ala Arg 

20 25 30 

Lys Arg Leu Asn Phe Asp Asp Glu Pro Gly Ala Gly Asp Gly Pro Pro 

35 40 45 

pro Glu Gly Pro ser ser Gly Ala Met ser Thr Glu Thr Glu Met Arg 

50 55 60 

Ala Ala Ala Gly Gly Asn Gly Gly Asp Ala Gly Gin Gly Ala Glu Gly 
65 70 75 80 

val Gly Asn Ala Ser Gly Asp Trp His cys Asp ser Thr Trp ser Glu 

85 90 95 

Ser His val Thr Thr Thr Ser Thr Arg Thr Trp val Leu Pro Thr Tyr 

100 105 HO 

Asn Asn His Leu Tyr Leu Arg Leu Gly Ser ser Asn Ala Ser Asp Thr 

115 120 125 

Phe Asn Gly Phe Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

130 135 140 

His cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn His 
145 150 155 loO 

Tro Gly Leu Arg Pro Lys Ser Met Gin Val Arg lie Phe Asn lie Gin 

165 170 175 

Val Lys Glu Val Thr Thr Ser Asn Gly Glu Thr Thr val Ser Asn Asn 

180 185 190 

Leu Thr Ser Thr Val Gin lie Phe Ala Asp Ser Thr Tyr Glu Leu Pro 

195 200 205 

Tyr Val Met Asp Ala Gly Gin Glu Gly Ser Leu Pro Pro Phe Pro Asn 

210 215 220 

Asp val Phe Met val Pro Gin Tyr Gly Tyr cys Gly Leu Val Thr Gly 
225 230 235 240 

Gly ser ser Gin Asn Gin Thr Asp Arg Asn Ala Phe Tyr cys Leu Glu 

245 250 255 

Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Met Val 

260 265 n 270 

Tyr Lys Phe Glu Asn Val Pro Phe His ser Met Tyr Ala His Ser Gin 

275 280 „ 285 

ser Leu Asp Arg Leu Met Asn Pro Leu Leu Asp Gin Tyr Leu Trp Glu 

290 295 300 

Leu Gin ser Thr Thr Ser Gly Gly Thr Leu Asn Gin Gly Asn Ser Ala 
305 310 315 320 

Thr Asn Phe Ala Lys Leu Thr Lys Thr Asn Phe Ser Gly Tyr Arg Lys 

325 330 335 

Asn Trp Leu Pro Gly Pro Met Met Lys Gin Gin Arg Phe Ser Lys Thr 

340 345 350 

Ala ser Gin Asn Tyr Lys He Pro Gin Gly Arg Asn Asn ser Leu Leu 

355 360 365 

His Tyr Glu Thr Arg Thr Thr Leu Asp Gly Arg Trp ser Asn Phe Ala 

3/0 375 380 

Pro Gly Thr Ala Met Ala Thr Ala Ala Asn Asp Ala Thr Asp Phe ser 
385 390 395 400 

Gin Ala Gin Leu lie Phe Ala Gly Pro Asn lie Thr Gly Asn Thr Thr 

405 410 415 

Thr Asp Ala Asn Asn Leu Met Phe Thr Ser Glu Asp Glu Leu Arg Ala 

420 425 430 

Thr Asn Pro Arg Asp Thr Asp Leu Phe Gly His Leu Ala Thr Asn Gin 

435 440 445 

Gin Asn Ala Thr Thr Val Pro Thr val Asp Asp val Asp Gly val Gly 

450 455 460 

Val Tyr Pro Gly Met val Trp Gin Asp Arg Asp He Tyr Tyr Gin Gly 
465 470 475 480 

pro He Trp Ala Lys He Pro His Thr Asp Gly His Phe His Pro ser 

485 490 _ 495 

pro Leu He Gly Gly Phe Gly Leu Lys Ser Pro pro Pro Gin lie Phe 
500 505 510 
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lie Lys Asn Thr Pro val Pro Ala Asn Pro Ala Thr Thr Phe Ser Pro 

515 520 525 

Ala Arg He Asn Ser Phe lie Thr Gin Tyr ser Thr Gly Gin val Ala 

530 535 540 

Val Lys He Glu Trp Glu He Gin Lys Glu Arg Ser Lys Arg Trp Asn 
545 550 555 560 

Pro Glu val Gin Phe Thr ser Asn Tyr Gly Ala Gin Asp ser Leu Leu 

565 570 575 

Trp Ala Pro Asp Asn Ala Gly Ala Tyr Lys Glu Pro Arg Ala lie Gly 

580 585 590 

Ser Arg Tyr Leu Thr Asn His Leu 
595 600 

<210> 56 
<211> 1617 
<212> DNA 

<213> Artificial Sequence 

<220> -, 
<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 56 

atgcgtgcag 

aatgcctccg 

tcaacccgca 

agcaacgcca 

cgcttccact 

ctgcgcccca 

tctaacgggg 

agcacgtacg 

cccaacgacg 

tctcaaaacc 

ctgagaaccg 

atgtacgctc 

tgggagctcc 

tttgccaagc 

atgatgaagc 

agaaacaaca 

tttgccccgg 

cagctcatct 

atgttcactt 

cacctggcaa 

gtcggcgtgt 

tgggccaaaa 

ggactgaaaa 

gcaacgacct 

gtggctgtca 

gtccagttca 

ggagcctaca 



cagctggcgg 
gtgattggca 
cctgggtcct 
gcgacacctt 
gccacttctc 
aaagcatgca 
agacgaccgt 
agctcccgta 
tgttcatggt 
agacagacag 
gaaacaactt 
acagccagag 
agtctaccac 
tgaccaaaac 
agcagagatt 
gtctgctcca 
gaacggccat 
ttgcggggcc 
cagaagatga 
ccaaccagca 
acccgggaat 
ttccacacac 
gcccgcctcc 
tctctccggc 
aaatagaatg 
cgtccaacta 
aagagcccag 



aaatggtggc 
ttgcgattcc 
gccgacctac 
caacggattc 
gccaagagac 
agtccgcatc 
atccaacaac 
cgtgatggat 
gcctcagtac 
aaatgccttc 
tgagatggtg 
cctggatagg 
ctctggagga 
aaacttttct 
ctccaagact 
ttatgagacc 
ggcaaccgca 
caacatcacc 
acttagggcc 
aaacgccacc 
ggtgtggcag 
ggatggacac 
acaaatattc 
cagaatcaac 
ggaaatccag 
cggagcacag 
ggccattgga 



gatgcgggac 
acttggtcag 
aacaaccacc 
tccaccccct 
tggcaaaggc 
ttcaacatcc 
ctcaccagca 
gcaggtcagg 
gggtactgcg 
tactgtctgg 
tacaagtttg 
ctgatgaacc 
actctcaacc 
ggctaccgca 
gccagtcaaa 
agaactaccc 
gccaacgacg 
ggcaacacca 
accaaccccc 
accgttccta 
gacagagaca 
tttcacccgt 
atcaaaaaca 
agcttcatca 
aaggagcggt 
gactcgcttc 
tcccgatacc 



aaggtgccga 
agagccacgt 
tgtacctgcg 
ggggatactt 
tcatcaacaa 
aagttaagga 
cggtccagat 
agggcagctt 
gactggtaac 
agtactttcc 
aaaacgtgcc 
cgctgctgga 
agggcaattc 
aaaactggct 
actacaagat 
tcgacggaag 
ccaccgactt 
ccacagatgc 
gggacactga 
ccgtagacga 
tttactacca 
ctcctctcat 
ctcctgtacc 
cccagtacag 
ccaagagatg 
tctgggctcc 
tcaccaacca 



gggagtgggt 
caccaccacc 
gctcggctcg 
tgactttaac 
ccactgggga 
ggtcacgacg 
ctttgcggac 
gcctcctttc 
cggaggcagc 
cagccagatg 
cttccactcc 
ccagtacctg 
agccaccaac 
cccggggccc 
tccccaggga 
atggagcaat 
ctctcaggcc 
caataacctg 
cctgtttggc 
cgtggacgga 
agggcccatt 
tggcggattt 
cgccaatccc 
caccggacag 
gaacccagag 
cgacaacgcc 
cctctag 



<210> 57 
<211> 538 
<212> PRT 

<213> Artificial sequence 

<220> ..... 
<223> Description of Artificial sequence; note = 
synthetic construct 

Met°Arg 7 Ala Ala Ala Gly Gly Asn Gly Gly Asp Ala Gly Gin Gly Ala 
15 10 15 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1617 
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Glu Gly val 

ser Glu Ser 
35 

Thr Tyr Asn 

Asp Thr Phe 
65 

Arg Phe His 



Gly Asn Ala 
20 

His Val Thr 
Asn His Leu 



Asn Gly Phe 
70 

cys His Phe 
85 

Asn His Trp Gly Leu Arg 
100 

Lys Glu Val 



He Gin val 
115 
Asn Asn Leu 

130 
Leu Pro Tyr 
145 

Pro Asn Asp 

Thr Gly Gly 

Leu Glu Tyr 
195 

Met Val Tyr 

210 
ser Gin ser 
225 

Trp Glu Leu 

ser Ala Thr 

Arg Lys Asn 
275 

Lys Thr Ala 

290 
Leu Leu His 
305 

Phe Ala Pro 

phe Ser Gin 

Thr Thr Thr 
355 

Arg Ala Thr 
370 

Asn Gin Gin 
385 

val Gly val 



Thr ser Thr 

val Met Asp 
150 

val Phe Met 

165 
Ser Ser Gin 
180 

Phe Pro ser 



ser Gly 

Thr Thr 
40 
Tyr Leu 
55 

Ser Thr 

ser Pro 

Pro Lys 

Thr Thr 
120 
Val Gin 
135 

Ala Gly 
Val Pro 
Asn Gin 



Asp Trp 
25 

Ser Thr 
Arg Leu 
pro Trp 



His cys Asp 

Arg Thr Trp 
45 

Gly ser ser 
60 

Gly Tyr Phe 
75 

Trp Gin Arg 



Arg Asp 
90 

ser Met Gin val Arg 
105 

ser Asn 



Ser Thr Trp 
30 

Val Leu Pro 

Asn Ala Ser 

Asp Phe Asn 
80 

Leu lie Asn 
95 

lie Phe Asn 
110 

Thr val Ser 



Gly Glu Thr 
125 

lie Phe Ala Asp Ser Thr Tyr Glu 
140 

Gin Glu Gly ser Leu 
155 

Gly Tyr Cys 



Gin Met 
200 

Lys Phe Glu Asn Val 
215 

Leu Met 



Gin Tyr 
170 

Thr Asp Arg Asn Ala 
185 

Leu Arg 



pro Pro Phe 
160 

Gly Leu val 
175 

Phe Tyr Cys 
190 

Asn Phe Glu 



Leu Asp Arg 
230 

Gin ser Thr 

245 
Asn Phe Ala 
260 

Trp Leu Pro 

ser Gin Asn 

Tyr Glu Thr 
310 

Gly Thr Ala 
325 

Ala Gin Leu 
340 

Asp Ala Asn 
Asn Pro Arg 



Asn Ala Thr 
390 

Tyr Pro Gly 
405 

Gin Gly Pro lie Trp Ala 
420 

Leu lie Gly 



Thr ser 

Lys Leu 

Gly Pro 
280 
Tyr Lys 
295 

Arg Thr 

Met Ala 

He Phe 

Asn Leu 
360 
Asp Thr 
375 

Thr Val 
Met val 
Lys He 



Pro Phe 

Asn Pro 

Gly Gly 
250 
Thr Lys 
265 

Met Met 

lie Pro 

Thr Leu 

Thr Ala 
330 
Ala Gly 
345 

Met Phe 
Asp Leu 
pro Thr 



Thr Gly Asn 
205 

His Ser Met Tyr Ala His 

220 
Leu Leu Asp 
235 

Thr Leu Asn 



Thr Asn Phe 

Lys Gin Gin 
285 

Gin Gly Arg 

300 
Asp Gly Arg 
315 

Ala Asn Asp 

Pro Asn lie 

Thr Ser Glu 
365 

Phe Gly His 

380 
Val Asp Asp 
395 

Asp Arg Asp 



pro ser Pro 
435 
lie Phe lie 

450 
Ser Pro Ala 
465 

Val Ala Val 
Trp Asn Pro 
Leu Leu Trp 



Gly Phe 
440 

Lys Asn Thr Pro Val 
455 

ser Phe 



Trp Gin 
410 

pro His Thr Asp Gly 
425 

Gly Leu 



Arg lie Asn 
470 

Lys He Glu 

485 
Glu Val Gin 
500 

Ala Pro Asp 



Trp Glu 
phe Thr 
Asn Ala 



pro Ala 

lie Thr 

lie Gin 
490 
ser Asn 
505 

Gly Ala 



Lys Ser Pro 
445 

Asn Pro Ala 

460 
Gin Tyr ser 
475 

Lys Glu Arg 
Tyr Gly Ala 
Tyr Lys Glu 



Gin Tyr Leu 
240 

Gin Gly Asn 
255 

Ser Gly Tyr 
270 

Arg Phe Ser 

Asn Asn Ser 

Trp ser Asn 
320 

Ala Thr Asp 
335 

Thr Gly Asn 
350 

Asp Glu Leu 

Leu Ala Thr 

Val Asp Gly 
400 

lie Tyr Tyr 

415 
His Phe His 
430 

pro Pro Gin 

Thr Thr Phe 

Thr Gly Gin 
480 

ser Lys Arg 

495 
Gin Asp ser 
510 

Pro Arg Ala 
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515 520 525 

lie Gly ser Arg Tyr Leu Thr Asn His Leu 
530 ~ 535 

<210> 58 
<211> 150 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence; note = 
synthetic construct 

qtggcactcc cccccctgtc gcgttcgctc gttcgctggc tcgattgggg gOTtggcagc 
tclLgagct gccagacgac ggccctctgg gccgtcgccc ccccaatcga gccagcgaac 
gagcgaacgc gacagggggg ggagtgccac 

<210> 59 
<211> 20 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 59 

ctctagcaag ggggttttgt 

<210> 60 
<211> 7 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 60 
agtgtgg 

<210> 61 
<211> 158 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence; note = 
synthetic construct 

aqqtqqtgat gtcattgttg atgtcattat agttgtcacg cgatagttaa tgattaacag 
tcatgtgatg tgtgttatcc aataggatga aagcgcgcga atgagatctc gcgagacttc 
cggggtataa aaggggtgag tgaacgagcc cgccgcca 

<210> 62 
<211> 112 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial sequence; note = 
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synthetic construct 



qqtqqattct gggtatattc ccgcctacct gctgccgaag gtccaaccag agcttcagtg 
ggcgtggact aacctcgaag agtataaatt ggccgccctc aatctggagg ag 

<210> 63 

<211> 169 

<212> DNA t 

<213> Artificial sequence 

<220> ,_ , r . , -j + a 

<223> Description of Artificial Sequence; note = 
synthetic construct 

ag?2aaagac ttttttgctt gggcaaaggt caaccaggtg ccggtgactc acgagtttat 
ggttcccaag aaagtggcgg gaactgagag ggcggagact tctagaaaac gcccactgga 
tgacgtcacc aataccaact ataaaagtcc ggagaagcgg gcccggctc 

<210> 64 
<211> 4721 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence; note = 
synthetic construct 



60 
112 



<400> 64 

ttggccactc 

agacggcaga 

gccaactcca 

gtaaatcacg 

ttttgcgaca 

attttgaccg 

aaggtgccga 

gtggccgaga 

caggcacccc 

gtgagtaagg 

caccttcacg 

agtcagattc 

aactggttcg 

gagtgctaca 

actaacatgg* 

gcgcagcacc 

aattctgacg 

tggctggtgg 

tacatctcct 

gccggcaaga 

cccgcggaca 

gcctacgccg 

atctggctgt 

gccgtgccct 

gtcgacaaga 

gccaaggcca 

cagatcgacc 

gggaacagca 

ctcacccgcc 

ttccgctggg 

ggagccagca 

ccctcagtcg 

aggtaccaaa 



cctctatgcg 

gctctgctct 

tcactagggg 

tcatagggga 

ccacgtggcc 

cgaaatttga 

gcgacctgga 

aggaatggga 

tgaccgtggc 

ccccggaggc 

ttctggtgga 

gggagaagct 

cggtgaccaa 

tccccaacta 

aggagtatat 

tgacccacgt 

cgcccgtgat 

accggggcat 

tcaacgccgc 

tcatggcgct 

ttaaaaccaa 

gctccgtctt 

ttgggcccgc 

tctacggctg 

tggtgatctg 

ttctcggcgg 

ccacccccgt 

ccaccttcga 

gtctggagca 

ccagtgatca 

aaagacccgc 

cggatccatc 

acaaatgttc 



cgctcgctcg 

gccggcccca 

taccgcgaag 

gtggtcctgt 

atttgaggta 

acgagcagca 

cgagcacctg 

gctgcccccg 

cgagaagctg 

cctgttcttt 

gaccacgggg 

ggtccagacc 

gacgcgtaat 

cctcctgccc 

aagcgcgtgt 

cagccagacg 

caggtcaaaa 

cacctccgag 

ctccaactcg 

gaccaaatcc 

ccgcatctac 

tctcggctgg 

caccaccggc 

cgtcaactgg 

gtgggaggag 

cagcaaggtg 

gatcgtcacc 

gcaccagcag 

cgactttggc 

cgtgaccgag 

ccccgatgac 

gacgtcagac 

tcgtcacgcg 



ctcggtgggg 

ccgagcgagc 

cgcctcccac 

attagctgtc 

tatatggccg 

gccatgccgg 

ccgggcattt 

gattctgaca 

cagcgcgact 

gttcagttcg 

gtcaagtcca 

atctaccgcg 

ggcgccggcg 

aagacccagc 

ttgaacctgg 

caggagcaga 

acctccgcgc 

aagcagtgga 

cggtcccaga 

gcgcccgact 

cgcatcctgg 

gcccagaaaa 

aagaccaaca 

accaatgaga 

ggcaagatga 

cgcgtggacc 

tccaacacca 

ccgttgcagg 

aaggtgacga 

gtggcgcatg 

gcggatataa 

gcggaaggag 

ggcatgattc 



cctgcggacc 

gagcgcgcat 

gctgccgcgt 

acgtgagtgc 

agtgagcgag 

gtttctacga 

ctgactcgtt 

tggatctgaa 

tcctggtcca 

agaagggcga 

tggtgctagg 

gggtcgagcc 

gggggaacaa 

ccgagctgca 

ccgaacgcaa 

acaaggagaa 

gctacatgga 

tccaggagga 

tcaaggccgc 

acctggtggg 

agctgaacgg 

agttcgggaa 

ttgcggaagc 

actttccctt 

cggccaaggt 

aaaagtgcaa 

acatgtgcgc 

accggatgtt 

agcaggaagt 

agttctacgt 

gcgagcccaa 

ctccggtgga 

agatgctgtt 



aaaggtccgc 

agagggagtg 

cagcgctgac 

ttttgcgaca 

caggatctcc 

gatcgtgatc 

tgtgaactgg 

tctgatcgag 

atggcgccgc 

gagctacttc 

ccgcttcctg 

cacgctgccc 

ggtggtggac 

gtgggcgtgg 

acggctcgtg 

tctgaacccc 

gctggtcggg 

ccaggcctcg 

gctggacaat 

gccctcgctg 

gtacgatcct 

gcgcaacacc 

catcgcccac 

caacgattgc 

cgtggagtcc 

gtcgtccgcc 

cgtgattgac 

caaatttgaa 

caaagagttc 

cagaaagggc 

gcgggcctgc 

ctttgccgac 

tccctgcaaa 



60 
120 
169 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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acgtgcgaga 
ttagagtgtt 
aaactctgcg 
gacctggtca 
tatggctgcc 
cgagtggtgg 
caacggccgg 
caagggggag 
ccagcagctc 
tcaggagcgt 
ggccaagaag 
tgcaaagaag 
cggcaagaaa 
agagtcagtc 
atctggtaca 
cggagtgggt 
cattaccacc 
aatctccagt 
ctgggggtat 
actcatcaac 
ccaggtcaag 
cacgattcag 
ccagggctgc 
gactctcaac 
cccctctcag 
gcctttccac 
cgaccagtac 
tcgggaactg 
gttacctgga 
cagcaacttt 
taatcccggc 
cggagtcctg 
aatgacaaat 
agtcagcagc 
gggagcctta 
ggccaagatt 
acttaaacat 
ggaggtgttt 
cagcgtggaa 
tcagtacacc 
tgtttactct 
tgttaatcaa 
cttatcggtt 
aacactgacg 
cgctcgctcg 
gccccaccga 



gaatgaatca 

tccccggcgt 

cgattcatca 

acgtggacct 

gatggttatc 

gacctgaaac 

ggtctggtgc 

cccgtcaacg 

aaagcgggtg 

ctgcaagaag 

cgggttctcg 

agaccggtag 

ggccagcagc 

cccgaccctc 

gtggctgcag 

aatgcctcag 

agcacccgaa 

gaaactgcag 

tttgacttta 

aacaactggg 

gaggtcacga 

gtattctcgg 

ctgcctccgt 

aatggcagtc 

atgctgagaa 

agcagctacg 

ttgtactacc 

cagttttacc 

ccttgcttcc 

gcttggactg 

gtcgccatgg 

atttttggaa 

gaagaagaaa 

aacttacaag 

cctggcatgg 

cctcacacgg 

ccgcctcctc 

actcctgcca 

atcgagtggg 

tccaactttg 

gagcctcgcc 

taaaccggtt 

tccatagcaa 

tcaccgcggt 

gtggggcctg 

gcgagcgagc 



gaatttcaac 
gtcagaatct 
tctgctgggg 
ggacgactgc 
ttccagattg 
ctggagcccc 
ttcctggcta 
cggcggacgc 
acaatccgta 
atacgtcatt 
aacctctcgg 
agccgtcacc 
ccgccagaaa 
aacctctcgg 
gcggtggcgc 
gaaattggca 
cctgggccct 
gtagtaccaa 
acagattcca 
gattccggcc 
cgaatgacgg 
actcggaata 
tcccggcgga 
agtctgtggg 
cgggcaacaa 
cacacagcca 
tggccagaac 
agggcgggcc 
ggcaacaaag 
gtgccaccaa 
caactcacaa 
aaactggagc 
ttcgtcctac 
cggctaatac 
tctggcagaa 
atggcaactt 
agatcctgat 
agtttgcttc 
agctgcagaa 
aaaagcagac 
ctattggcac 
gattcgtttc 
ctggttacac 
acccctagtg 
cggaccaaag 
gcgcatagag 



atttgcttca 
caaccggtcg 
cgggcgcccg 
gtttctgagc 
gctcgaggac 
gaaacccaaa 
caagtacctc 
agcggccctc 
cctgcggtat 
tgggggcaac 
tctggttgag 
tcagcgttcc 
gagactcaat 
agaacctcca 
accaatggca 
ttgcgattcc 
gcccacctac 
cgacaacacc 
ctgccacttc 
caagaagctg 
cgttacgacc 
ccagctgccg 
cgtcttcatg 
acgttcctcc 
ctttgagttc 
gagcctggac 
acagagtaac 
ttcaactatg 
agtctccaaa 
atatcacctg 
ggacgacgag 
aactaacaaa 
taatcctgta 
tgcagcccag 
ccgggacgtg 
tcacccgtct 
caagaacact 
gttcatcaca 
ggaaaacagc 
tggtgtggac 
tcgttacctc 
agttgaactt 
attaactgct 
atggagttgg 
gtccgcagac 
ggagtggcca 



cacacggggt 

tcagaaaaaa 

agattgcttg 

aataaatgac 

aacctctctg 

gccaaccagc 

ggacccttca 

gagcacgaca 

aaccacgccg 

ctcgggcgag 

gaaggcgcta 

cccgactcct 

ttcggtcaga 

gcagcgccct 

gacaataacg 

acatggctgg 

aacaaccacc 

tacttcggct 

tcaccacgtg 

cggttcaagc 

atcgctaata 

tacgtcctcg 

attcctcagt 

ttctactgcc 

agctacagct 

cggctgatga 

ccaggaggca 

gccgaacaag 

acgctggatc 

aacggcagaa 

gaccgctttt 

actacattgg 

gccacggaag 

acacaagttg 

tacctgcagg 

cctttgatgg 

cccgttcccg 

cagtacagca 

aagcgctgga 

tttgccgttg 

acccgtaatc 

tggtctcctg 

tgggtgcgct 

ccactccctc 

ggcagagctc 



cagagactgt 
gacgtatcgg 
ctcggcctgc 
ttaaaccagg 
agggcattcg 
aaaagcagga 
acggactcga 
aggcctacga 
acgccgagtt 
cagtcttcca 
agacggctcc 
ccacgggcat 
ctggcgactc 
ctagtgtggg 
aaggtgccga 
gcgacagagt 
tctacaagca 
acagcacccc 
actggcagcg 
tcttcaacat 
accttaccag 
gctctgcgca 
acggctacct 
tggagtactt 
tcgaggacgt 
atcccctcat 
cagctggcaa 
ccaagaattg 
aaaacaacaa 
actcgttggt 
tcccatccag 
aaaatgtgtt 
aatacgggat 
tcaacaacca 
gtcccatctg 
gcggctttgg 
ctaatcctcc 
ccggacaagt 
acccggagat 
acagccaggg 
tgtaattgca 
tgcttcttat 
tcacgataag 
tatgcgcgct 
tgctctgccg 



<210> 65 
<211> 623 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 65 

Met Pro Gly Phe Tyr Glu lie val lie Lys Val Pro ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe val Asn Trp val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 
35 40 45 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4721 
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Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 _ 

val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His Val Leu val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arq Glu Lys Leu Val Gin Thr lie Tyr Arg Gly val Glu Pro Thr Leu 

115 120 125 

Pro Asn Trp Phe Ala val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 

165 170 175 ( 

ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu val Ala Gin His 

180 185 190 

Leu Thr His val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 

195 200 205 

Pro Asn ser Asp Ala Pro val He Arg Ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu val Gly Trp Leu val Asp Arg Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie ser Phe Asn Ala Ala 

245 250 255 

ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

He Met Ala Leu Thr Lys ser Ala Pro Asp Tyr Leu Val Gly Pro ser 

275 280 285 

Leu pro Ala Asp lie Lys Thr Asn Arg lie Tyr Arg He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Ala Tyr Ala Gly ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 330 ^ 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 n 365 

cys Val Asp Lys Met val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys val val Glu ser Ala Lys Ala He Leu Gly Gly ser Lys val Arg 
385 390 395 400 

val Asp Gin Lys cys Lys ser Ser Ala Gin He Asp Pro Thr Pro Val 

405 410 415 

lie Val Thr ser Asn Thr Asn Met cys Ala Val He Asp Gly Asn ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 470 475 480 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala ser Lys Arg Pro Ala 

485 490 495 

Pro Asp Asp Ala Asp He ser Glu Pro Lys Arg Ala cys Pro ser val 

500 505 510 

Ala asp Pro ser Thr ser Asp Ala Glu Gly Ala pro val Asp Phe Ala 

515 520 525 

Asp Arg Tyr Gin Asn Lys cys ser Arg His Ala Gly Met lie Gin Met 

530 535 540 

Leu Phe Pro Cys Lys Thr cys Glu Arg Met Asn Gin Asn Phe Asn lie 
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545 550 555 , n 560 

Cvs Phe Thr His Gly val Arg Asp cys Leu Glu cys Phe Pro Gly Val 

565 " 570 575 

Ser Glu Ser Gin Pro Val val Arg Lys Lys Thr Tyr Arg Lys Leu Cys 

580 585 590 

Ala lie His His Leu Leu Gly Arg Ala Pro Glu He Ala Cys ser Ala 

595 600 605 

Cys Asp Leu val Asn val Asp Leu Asp Asp Cys val Ser Glu Gin 

610 615 620 

<210> 66 
<211> 737 
<212> PRT 

<213> Artificial sequence 

<220> r (J! . . . 

<223> Description of Artificial sequence; note = 
synthetic construct 

Met°Ala 6 Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 , oU 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr ser Phe Gly Gly 

100 105 „ HO 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 

115 120 125 

Leu Gly Leu val Glu Glu. Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 

130 135 140 

Pro val Glu Pro ser Pro Gin Arg Ser pro Asp Ser Ser Thr Gly lie 
145 150 155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 

165 170 n 175 

Thr Gly Asp ser Glu ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 

180 185 190 

Pro Ala Ala Pro ser ser val Gly ser Gly Thr val Ala Ala Gly Gly 

195 200 205 

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn 

210 215 220 

Ala Ser Gly Asn Trp His cys Asp ser Thr Trp Leu Gly Asp Arg val 
225 230 235 *w 

He Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 

245 250 255 

Leu Tyr Lys Gin He Ser Ser Glu Thr Ala Gly ser Thr Asn Asp Asn 

260 265 270 

Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 

275 280 n 285 

Phe His Cys His Phe ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 

290 295 300 

Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie 
305 310 315 320 

Gin val Lys Glu Val Thr Thr Asn Asp Gly val Thr Thr lie Ala Asn 

325 330 335 

Asn Leu Thr Ser Thr He Gin val Phe Ser Asp Ser Glu Tyr Gin Leu 
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340 345 350 

pro Tyr val Leu Gly ser Ala His Gin Gly cys Leu Pro Pro Phe Pro 

355 360 3155 

Ala asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 

370 375 380 

Gly ser Gin Ser Val Gly Arg ser ser Phe Tyr cys Leu Glu Tyr Phe 
385 390 395 400 

pro ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe ser Tyr ser 

405 410 415 

Phe Glu Asp val Pro Phe His Ser Ser Tyr Ala His Ser Gin ser Leu 

420 425 430 

asd Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala 

W 435 440 445 

Ara Thr Gin Ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 
y 450 455 460 



Phe Tyr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 

470 475 480 

w - Phe Arg Gin Gin Arg val Ser Lys Thr Leu 
485 ~ 490 495 



465 ~ Tt 1 

Leu Pro Gly Pro cys Phe Arg Gin Gin Arg val Ser Lys Thr Leu Asp 
Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 



500 505 510 

Leu Asn Gly Arg Asn ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 

515 520 525 

His Lys Asp Asp Glu Asp Arg Phe Phe Pro ser ser Gly val Leu lie 

530 535 540 

Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn val Leu 
545 550 555 560 

Met Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu 

565 570 575 

Glu Tvr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 

580 585 , 590 

Gin Thr Gin Val val Asn Asn Gin Gly Ala Leu Pro Gly Met val Trp 

595 600 605 

Glh Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro 

610 615 620 

His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625 630 635 640 

Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro val Pro 

645 650 n 655 

Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala ser Phe lie 

660 665 n 670 

Thr Gin Tyr ser Thr Gly Gin val Ser val Glu He Glu Trp Glu Leu 

675 680 685 

Gin Lys Glu Asn ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr ser 

690 695 700 

Asn Phe Glu Lys Gin Thr Gly Val Asp Phe Ala val Asp ser Gin Gly 
705 710 715 720 

Val Tyr ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn 
725 730 735 

Leu 

<210> 67 
<211> 8 
<212> PRT 

<213> Artificial sequence 

<220> ..... 
<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 67 

Gly ser Ser Asn Ala ser Asp Thr 
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<210> 68 
<211> 14 
<212> PRT 

<213> Artificial sequence 

<220> . . , 

<223> Description of Artificial sequence; note - 
synthetic construct 

Thr°Thr 8 ser Gly Gly Thr Leu Asn Gin Gly Asn Ser Ala Thr 
1 5 10 

<210> 69 
<211> 6 
<212> PRT 

<213> Artificial Sequence 

<220> ...... 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 69 

Asn Gly Arg Ala His Ala 
1 5 

<210> 70 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 70 

Ser lie Gly Tyr Pro Leu Pro 
1 5 

<210> 71 
<211> 10 
<212> PRT 

<213> Artificial sequence 

<220> ..... 
<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 71 , u ,V 

Lys Phe Asn Lys Pro Phe Val Phe Leu lie 

<210> 72 
<211> 22 
<212> PRT 

<213> Artificial sequence 

<223> Description of Artificial sequence; note = 
synthetic construct 
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Asn°lle 2 ser Leu Asp Asn Pro Leu Glu Asn pro ser ser Leu Phe Asp 
1 5 10 

Leu val Ala Arg lie Lys 
20 
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